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Preface 



With a standard program committee and a pre-review process, the Third In- 
ternational Workshop on Automated Deduction in Geometry (ADG 2000) held 
in Zurich, Switzerland, September 25-27, 2000 was made more formal than the 
previous ADG ’96 (Toulouse, September 1996) and ADG ’98 (Beijing, August 
1998). The workshop program featured two invited talks given by Ghristoph 
M. Hoffmann and Jurgen Bokowski, one open session talk by Wen-tsiin Wu, 
18 regular presentations, and 7 short communications, together with software 
demonstrations (see http://calfor.lip6.fr/~wang/ADG2000/). Some of the most 
recent and significant research developments on geometric deduction were re- 
ported and reviewed, and the workshop was well focused at a high scientific 
level. 

Fifteen contributions (out of the 18 regular presentations selected by the 
program committee from 31 submissions) and 2 invited papers were chosen for 
publication in these proceedings. These papers were all formally refereed and 
most of them underwent a double review-revision process. We hope that this 
volume meets the usual standard of international conference proceedings, repre- 
sents the current state of the art of ADG, and will become a valuable reference for 
researchers, practitioners, software engineers, educators, and students in many 
ADG-related areas from mathematics to GAGD and geometric modeling. 

ADG 2000 was hosted by the Department of Gomputer Science, ETH Zurich. 
We thank all the individuals, in particular external referees and members of the 
program committee, for their help with the organization of ADG 2000 and the 
preparation of this volume. The next workshop ADG 2002 will take place in 
Linz, Austria in September 2002. The proceedings of ADG ’96 and ADG ’98 
have been published as volumes 1360 and 1669 in the same series of Lecture 
Notes in Artificial Intelligence. 
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On Spatial Constraint Solving Approaches* 



Christoph M. Hoffmann and Bo Yuan 
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Purdue University 

West Lafayette, IN 47907-1398, USA 
{cmh,yuEui}@cs .purdue . edu 



Abstract. Simultaneous spatial constraint problems can be approached 
algebraically, geometrically, or constructively. We examine how each ap- 
proach performs, using several example problems, especially constraint 
problems involving lines. We also prove that there are at most 12 real 
tangents to four given spheres in R^. 



1 Introduction 

Spatial constraint solving involves decomposing the constraint schema into a 
collection of indecomposable subproblems, followed by a solution of those sub- 
problems. Good algorithms for decomposing constraint problems have appeared 
recently, including m- The best of those algorithms are completely general, 
adopting a generic degree-of-freedom reasoning approach that extends the older 
approach of searching for characteristic constraint patterns from a fixed reper- 
toire such as 0. 

In the spatial setting, even small irreducible problems give rise to nontrivial 
algebraic equation systems and yield a rich set of challenging problems. Restrict- 
ing to points and planes, prior work has succeeded in elucidating and solving 
with satisfactory results the class of octahedral problems. An octahedral prob- 
lem is an indecomposable constraint schema on six geometric entities, points 
and/or planes, with the constraint topology of an octahedron; see [ll7lll)j . Such 
problems have up to 16 real solutions. 

When lines are added as geometric primitives, even sequential problems be- 
come nontrivial, such as placing a single line at prescribed distances from four 
fixed points. In Q line problems have been investigated and solved using several 
homotopy continuation techniques in conjunction with algebraic simplification. 
In particular, the problem 3p3L was analyzed and solved in which three lines 
and three points are pairwise constrained in the topology of the complete graph 
Kq. In this paper, we consider the problems 4plL and 5plL of placing four or 
five points and one line by spatial constraints. We also contrast them to the 6p 
octahedral problem. Our main purpose is to learn how successful the different 
approaches to solving these problems are. 

* Work supported in part by NSF Grant CCR 99-02025, by ARO Contract 39136- MA, 
and by the Purdue Visualization Center. 



J. Richter-Gebert and D. Wang (Eds.): ADG 2000, LNAI 2061, pp. 1-|1^ 2001. 
(c) Springer- Verlag Berlin Heidelberg 2001 
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2 Three Ways to Solve Subproblems 

Once a subproblem has been identified, it must be translated into a simultaneous 
system of nonlinear equations, usually expressed algebraically. The system is 
then solved. Due to application considerations, we are especially interested in 
solution strategies that can, in principle, identify all real solutions of such a 
system. Thus we exclude in particular the usual Newton iteration approach that, 
beginning with a particular initial configuration, numerically determines at most 
one solution of the system. 

We are interested in three approaches to solving the algebraic equations that 
arise when evaluating a subproblem. 

1. Simplify the equations using a systematic set of techniques that are appro- 
priate for the problem. This is the approach taken in, e.g., fp. 

2. Apply a pragmatic mixture of geometric reasoning that simplifies the equa- 
tions, in conjunction with other algebraic manipulation. This approach has 
been taken in, e.g., | ^I7| . 

3. Adopt a procedural approach in which basic geometric reasoning results in 
a tractable, numerical procedure. This approach is familiar from, e.g., m 

In each case, the goal is to simplify the system so that it becomes tractable to 
evaluate all real solutions. Aside from the intrinsic repertoire of each of the three 
approaches, we note that the choice of a coordinate system in which to solve the 
system is of critical importance. 

We will explore how each of these approaches performs by considering the 
constraint subproblem in which 5 points and one line are to be placed subject to 
constraints on them. In Q, it was argued that a good choice of the coordinate 
system seeks to place the lines in a fixed position, formulating the equations on 
the points and on lines that could not be placed. We have found this to be a 
good idea as well. However, in the sequential line placing problem, we will see 
that it is better to place the points. 

In the following, we will consider three spatial irreducible constraint prob- 
lems: 

1. The 6p Octahedral Problem: Given six points in space and twelve pre- 
scribed distances between them in the topology of an octahedron, determine 
the six points relative to each other. This problem is related to the Stewart 
platform |S|. 

2. The 4plL Problem: Given four known points, find a line that lies at pre- 
scribed distance from each of them. Equivalently, find the common tangents 
to four fixed spheres Q. 

3. The 5plL Problem: Given one line and five points, and thirteen con- 
straints between them in the topology shown in Figure |3 determine their 
relative position. 

We will see that the first problem yields to the systematic simplification ap- 
proach. That it can be addressed with the second approach as well has been 
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shown in 0, among others. An especially nice solution using the Cayley-Menger 
determinant was presented by Michelucci in mu. 

The second problem is amenable to the algebraic approach as well, except 
that the coordinate system has to be chosen carefully. We will explain briefly 
two different choices and their consequences. 

Finally, the third constraint problem has not yielded to the first two con- 
straint solving approaches, and the only satisfactory approach we have found so 
far is the computational one. 



3 The Spatial Constraint Problems 



We explain each constraint problem we consider in turn, in increasing order of 
complexity. 



3.1 The 6p Octahedral Problem 

We are given six points and twelve distance constraints, as indicated in Figure m 
The position of the points relative to each other, or with respect to a global coor- 
dinate system, is not known. As noted in 0, this problem has several instances 




Fig. 1. The 6p Octahedral Problem: Graph vertices represent points, graph edges dis- 
tance constraints 



when replacing some of the points with planes and considering angle constraints 
between planes. In every case, the problem cannot be further decomposed and 
requires solving a simultaneous system of nonlinear equations. A solution is a 
coordinate assignment to the points that satisfies all twelve distance constraints. 
As we will explain, this problem yields to both the algebraic and to the reasoning 
approach. 
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3.2 4plL — Common Tangent to Four Spheres 

We are given four points in fixed location in a global coordinate system. We are 
asked to find a line in space that lies at prescribed distance from each of the four 
points. Equivalently, we are given four fixed spheres in 3-space, not necessarily 
of the same radius, and are asked to find a line that is tangent to each sphere. 

This problem is a sequential construction problem. A line has four indepen- 
dent coordinates, so four conditions such as the required distances determine its 
position. Suppose that we have a constraint problem in which each geometric 
element can be placed in a global coordinate system one-by-one in some order. 
If we admit as possible elements points, lines and planes, then this subproblem 
arises naturally. Note that geometric constraint problems that can be solved by 
such a sequential procedure are among the simplest problems. 

We will discuss an algebraic approach to solving this problem that relies on 
a good choice of the coordinate system. Geometric reasoning approaches appear 
to fail to lead to more simplification. 

3.3 The Problem 5plL 

Consider a configuration of five points and one line in 3-space that is constrained 
as shown in Figure El All constraints are distances. The subgraph of the five 




Fig. 2. The 5plL Problem: Graph vertices represent points and a line, graph edges 
distances. 

points has the topology of a square pyramid and is therefore not rigid. The point 
P 5 is the apex of the pyramid. In all, the configuration requires 19 generalized 
coordinates subject to 13 constraints, and is therefore generically a rigid body 
in 3-space. 
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4 Solving Strategies 

4.1 Algebraic Approach 

In the algebraic approach, we choose a coordinate system and formulate a set of 
algebraic equations based on that choice. The equations are then simplified and 
brought into a form that gives greater insight into the number of distinct solu- 
tions and is sufficiently simple that root finding or reliable numerical techniques 
can be applied to solve the system. Ideally, the approach follows a systematic 
generic framework for manipulating the equations. 



The 6p — Octahedral Problem The octahedral problem 6p has an elegant 
solution discovered by Michelucci m that is based on the Cayley-Menger deter- 
minant. Recall that the determinant relates the squared distances between five 
points in space. Consider the two unknown diagonal distances c?i 3 = d{pi^p^) 
and d ,24 = d{p 2 ,PA)- Choosing the five points {pi,P 2 ,P 3 ,PA,P 5 \, a quadratic 
relationship between d\^ and c ?|4 is obtained from the determinant. A similar 
relationship is obtained from the set {pi,p 2 ,P 3 ,P 4 ,P 6 }- Thus, we obtain two 
quartic equations in two unknowns, a system of total degree 16. 

Michelucci’s solution is independent of a coordinate system choice, a strong 
point, but it does not follow a systematic procedure. A systematic framework 
was developed by Durand in m Choosing to place one point at the origin, 
one point on the x-axis, and one point in the positive quadrant of the a:?/-plane, 
the initial system consists of nine quadratic equations in nine unknowns. This 
system is then simplified by the following steps: 

1. Gaussian elimination. 

2. Solving univariate equations. 

3. Parameterization of variables in bilinear and biquadratic equations. 

The resulting system for 6p are three quartic equations in three variables, a 
system of total degree 64. By applying techniques from homotopy continuation, 
the final system required evaluating only 16 roots, of which, in the examples 
studied, 8 were real and 8 were complex. 



4plL — Tangent to Four Spheres The problem would appear to be classi- 
cal, but we did not find much helpful literature on it. A systematic algebraic 
treatment of the problem was given by Durand in • Durand found an equation 
system of degree 64 (the BKK bound) and experimentally determined that 40 of 
the 64 paths led to infinity. Thus, only 24 paths had to be explored. We improve 
this result now. 

Placing three points at the origin, on the x-axis, and in the xy-plane, our 
initial equation system consists of six quadratic equations in six unknowns, (ID" 
EJ. The unknowns are the point (x, y, z) nearest to the origin on the sought line, 
and the unit length tangent {u,v,w) of the line. Assume that is the distance 
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of point i from the line, and that the point coordinates are (oi, 6^, Ci). Then the 
initial equation system is 



+ y^ + -rl = 0 (1) 

(o2 - a;)^ + - (o2w)^ - r| = 0 (2) 

(o3 - xf + (63 - - {asu + bsv)"^ - = 0 (3) 

(04 — x)'^ + (64 — y)^ + (c4 — z)^ — {a^u + b^v + c^w)^ — = 0 (4) 

XU + yv + zw = 0 (5) 

- 1 = 0 ( 6 ) 

We use equation dQ to eliminate the terms x^ , y^ and z^ from equations (0- 
0. Then those equations can be solved symbolically, yielding a solution that 
expresses the variables x,y and z as a quadratic expression in u,z; and w. This 
eliminates x,y and z from equations 6 and 6 and factors out a subsystem 
of three equations in u,v,w of degree 2, 3 and 4, respectively. Thus, a degree 
reduction to 24 has been accomplished. 

We note that for each solution (x, y, z, u, v, w) of the system (x, y, z, —u, —v, 
—w) is also a solution^ Geometrically, this says that the orientation of the lines 
is immaterial, which one expects. Therefore, the 24 solutions of the system, 
counted by Bezout’s theorem, reduce to 12 geometric solutions. That this is the 
smallest number possible follows from the result by Theobald et al. [Rj. They 
prove there are up to 12 distinct real tangents when all radii are equal, that is, 
when ri = r2 = T3 = T4 = r. 

It would seem that one could place the unknown line on the x-axis and seek 
equations to place the four points as a rigid structure subject to the distance con- 
straints. Doing so yields equations with a high degree of symmetry and structure, 
but we have not found an attractive simplification of those equations. 



5pLl We can choose a coordinate system in which the line L is on the x-axis 
and the point on the z-axis as shown in Figure 0 We denote the distance 
between L and the point pi with r^, i = 1,...,5. The distance between and 
point p5 and pi, i = 1, ... ,4, is denoted di, and the distance between points pi 
and pj with dij. This choice leads to a system consisting of 12 equations in 12 
unknowns: 



yf + zf = rf t=l,...,4 

Xi + yf + (Zi - rs)2 = d^i i = 1 , . . . , 4 (7) 

(xi - Xj)2 -k {yi - yjY + (zi - Zj-y = dfj ij = 12,23,34,41 

Naive counting of the number of possible solutions would yield 4096. Using the 
multi-homogeneous Bezout theorem of im. a tighter bound of 512 finite solutions 

^ This is clearly true for the original system. Moreover, the expressions substitnted for 
X, y and z also exhibit the sign symmetry; hence the claim is true for the resulting 
system of degree 24. 
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is obtained. That bound does not make it practical to explore all solutions. 
Moreover, the system of equations resisted meaningful simplification, both ad- 
hoc manipulations as well as the systematic simplification steps developed before. 

We could choose to place the coordinate system such that three points are 
put into a special position, say one point at the origin, one on the a:-axis, and 
one in the a;y-plane, but doing so did not lead to better equations. 



4.2 Geometric Reasoning to Assist Simplification 

In this approach we try to introduce auxiliary geometric structures, such as 
the curves described by a particular point when restricting to a subset of the 
constraints, especially if this can lead to a reasonable parameterization. Often, 
one can then introduce the additional constraints and derive a simpler equation 
system. 



6p — Octahedron Geometric reasoning was used in jZ] to yield a system of 
equations that, in conjunction with resultant techniques, succeeded in deriving 
a univariate polynomial of degree 16. It improves on the systematic approach by 
a factor of 4 and matches the Cayley-Menger solution. 



4plL — Sphere Tangents Presently, we do not have a good solution that 
exploits the geometry of the configuration. We believe that it should be possible 
to find one of total degree to 24 or less. 



5plL Placing the coordinate system as before, with the line on the cc-axis and 
the point on the z-axis, we could proceed by parameterizing the locus of the 
point Pi as function of the z-coordinate Z. From the distance constraints ri and 
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di we obtain for the point 

{ Xi{t) = ±^d\ -rl~rl + 2rgt 
Pi= I yi{t) = ±\/rl - (8) 

[ zi{t) = t 

We can then construct the remaining points whose coordinates are now a func- 
tion of the parameter t, using the distance constraints for T2, ^2, and d\2 for p2, 
the distance constraints r4, <^4 and ^41 for p4. Finally, point ps is constructed us- 
ing rs, da and ^23- This leaves the distance constraint d^A to be used to determine 
the parameter t. The equations so derived have the following form: 

—idlxi{t)x2 + 8xi{t)yi{t)x2y2 ~ 8r5Xi{t)zi{t)x2 

- 8 r 5 Ui{t)zi{t)y 2 + 4 xi{t)'^xl + -I- 8r‘^xi{t)x2 

-Mlxi{t)x2 + Ml2Xi{t)x2 - 4:dlyi{t)y2 + 8r|yi(t)?/2 - 4dfyi(t)y2 

+4d?22/i(i)y2 + Mlr5Zi{t) - 4df 2^52^1 (i) - ^r^zi{t)yl + 4r|zi(t)2 

-8rfzi(t) -h 4zi(t)2y2 _g 4^2y2 _ 4^2^^ (-^^2 ^ 

-zi{t)xl + r^xl - 2r^xi{t)x2 - 2.r^yi{t)y2 + 2r|zi(t) 

+dlzi{t) - rjzi{t) - r|zi(t) = D2 
-4dlxi(t)xA + 8xi{t)yi{t)x4yA ~ 8r5Xi{t)zi{t)xA - Sr^yiit) zi{t)yA 
-|- 4 a;i(t)^a ;4 -I- 4 yi(t)^j /4 -|- 8r|a:i(t)a;4 — 4 c? 4 a;i(t)x 4 -I- 4^44X1 (t)a;4 
-4djyi{t)yA + 8r|?/i(t)j/4 - 4dlyi{t)yA + 4dlj^yi{t)yA + Mlr^ziit) 
-Adlj^r^ziit) - 8rs2;i(t)j/| -h 4 r|zi(t )2 - 8r|zi(t) 

' + 4 zi{t)^yj + 4 r\y\ - 4 r\zi(tY + Sr^rjziit) = D3 

-Zi{t)xl + r^x\ - 2rsa;i(t)a;4 - 2 r^y\(t)yA + 2 r|zi(t) -I- d4Zi(t) 

-r|zi(t) - rjziit) = Da 

-l2zi{t)x2X3 + 12x5X2X3 - 4r5Xi{t)x2 - 4r5yi{t)y2 + 4r|zi(t) 
-|-4r5Zi(t)z3 — 4r|z3 -|- 2d|zi(t) -|- 2d|zi(t) 

+2dl3Zi{t) - 4 r|zi(<) = £>5 

4zi(t)a;3X4 - 4r5X3CC4 -h 4zi(t)y3y4 - ^r^ysyA - 4xi(t)xAZ3 
-4yi{t)yAZ3 + 2d\z3 + 2d\zz - 2d\^Z3 + 4r5Xi{t)xA + 4r5yi(t)y4 

— 4r5Zi(t) — 2 d§zi(t) — 2 d\ziit) + 2d\4^Zi(t) + 4 r|zi(t) = Dq 

yl + 4 = D7 

xl - 2 rsZ 3 = Ds 

(9) 

where Z?i, D2, £3, Da, D5, Dq are constants. The system is unattractive. 



4.3 Construction by Computation 

The closed-form algebraic expressions for the point coordinates of the 5plL prob- 
lem that were obtained by the geometric reasoning described before, do not seem 
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to be simple enough to lead to further massive algebraic simplification. However, 
they are very easy to evaluate computationally, and can be used to define numer- 
ically a curve in a 2D coordinate space defined by the parameter and the distance 
^ 34 - When the curve is intersected with the nominal distance line, the real solu- 
tions are obtained. As illustrated in Figure 0 Pio is on line L and pioPi -L L, the 
angle between pioPi and the xy-plane is 9. We use 9 as parameter to calculate 
point Pi : 




( Xi{9) = ±\/dl - rf - r| -I- 2nr5 sin(0) 

Pi= I yi(d) = ri cos{9) (10) 

[ zi{9) = ri sin(6») 

For practical purposes, the approach is satisfactory, since it gives a system- 
atic, and sufficiently simple, procedure to find all real solutions. Moreover, the 
solutions so found can be further refined with other numerical processes, since 
they provide good starting points. From a theoretical perspective, the draw-back 
of the procedural approach is its inability to produce, with certainty, a bound on 
the number of solutions. Here are the details for our 5plL problem, and several 
example solutions. 

P 2 can be solved using the constraints dist{L,p 2 ) = V 2 , dist{p^,p 2 ) = ^2 
and dist{pi,p 2 ) = di 2 - As illustrated in Figure El the point Ps = (Xs,ys,Zs) in 
triangle Apip 2 Ps is on the line pipE and P 2 ps -L PiPE- So, we have 



s = \PsP5\ 



jdl + dl-dl,) 

2di 



h = \PsP2\ = \Jdl~ s'^ 



s 

Ps=P5 + 

di 



{Pi 



Pb) 



We obtain 
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P2 




Fig. 5. Triangle P1P2P5 



Consider the vector w = • We define a plane II through the point 

Ps perpendicular to w. Since dist{L,p2) = T2, the point p2 is on the cylinder 
S : y'^ + z'^ = r 2 whose axis is the line L and whose radius is T2 ■ Let pc be the 
interaction point of line L and plane II, then p^ = (xc, -Zc) where 



yc = 0 
Zc = 0 



Using the vectors 



U = V X w 

we set up a local coordinate system: {o' ,x' ,y' , z'), where 

o' = Pc 
x' = u 
y' = v 
z' = w 



The matrix transform from the global coordinate system (o, x, y, z) to the local 
system {o' ,x' ,y' , z') is 



■ Wx 


Uy 


Uz 


0- 




-1 


0 


0 


-Xc' 


Vx 


Vy 


Vz 


0 




0 


1 


0 


-Vc 


Wx 


Wy 


Wz 


0 




0 


0 


1 


-Zc 


. 0 


0 


0 


1. 




.0 


0 


0 


1 . 



( 11 ) 



Figure El illustrates the local coordinate system {o' ,x' ,y' , z') situated in the 
global system (o, x, y, z). 

From the construction process we know that the point p2 lies on a circle 
in the plane II with radius h. Let p'^ = Mps, in the local coordinate system 
{o' ,x' ,y' , z'). Then the equation of the circle is 



(x' - x' + {y' - y',f = 



(12) 
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Fig. 6. Local Coordinate System [o' ,x' ,y' , z') 

Now the vector along line L is L : (1, 0, 0), the angle between L and w is /3, and 
the intersection of plane II and cylinder E is an ellipse on plane II. In the local 
system {o' ,x' ,y\ z'), the ellipse equation is 

,2 ,2 

+ ^ = 1 113) 

r2CSC^(/3) 

Solving equations (O and (d simultaneously, we get p '2 and from it, in turn, 
P2- 

P2 = {x2{0),y2{9),Z2{0)) 

Note that there are up to 4 real solutions for p 2 - 

Similarly, we compute pi from its constraints with the line L and the points 
Pi and P 5 . Finally, we compute ps from the constraints with line L and points 
P 2 and P5 

P3 = {X3{9),y3{O),Z3{0)) 

Pi = {xi{9),yi{e),Zi{e)) 

^34(0) is a complicated curve in a coordinate space defined by the parameter 9 
and the distance dist{p 3 { 9 ),pi{ 9 )). The curve would be hard to express symbol- 
ically. However, we can trace it numerically. 

Given a step length d9, we calculate ^34(0) for every step 9 = 9 + d9, and 
so obtain the curve Cg : d 3 i{ 9 ) — 9 numerically. Let the absolute error of <^34(0) 
and the nominal distance line (^34 be 

p{9) = \d3i{9) - d3i\ 

Obviously, the smaller p{9) is, the nearer 0 is to a real solution of the 5pLl 
problem. Call a point (0, ^34(0)) a coarse solution if 9 satisfies 

d3i{9) < S 
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Table 1. An Constraint Set of the 5plL Problem 



ri 


5.12863551744133 


V2 


3.4797204504532 


ra 


5.12009033478805 


T4 


4.48866237372967 


r& 


0.854823450422681 


di 


5.40391247291482 


d.2 


4.92751853999451 


da 


6.556901760918 


di 


5.04776146732994 


di2 


2.49916074098941 


d2a 


9.55687124240852 


dai 


9.15 


dii 


7.1858882412183 



for a chosen tolerance 6. The coarse solution set Ss is then 

Ss = {qg = (0,d34:{9))\p{6) < S,qg G Cg} 

S is the threshold of the coarse solutions, and the size of diminishes with 
S. The coarse solutions can be further refined with Newton-Raphson iteration 
since they provide good starting points. 

4.4 An Example 

Table □ gives an example of constraint set of 5pLl problem, by defining d9 = 
1.0°, Figure 0 gives the discrete curve. In our example, if <5 = 0.1, |5c| = 108, 
if i5 = 0.2, |S'c| = 224. When 6 = 0.1 we can get 20 refined real solutions; when 
S = 0.2 we can get 24 refined real solutions; when <5 > 0.2 we have more than 224 
coarse solutions but the refined real solution number is still 24. Therefore, the 
maximum real solution number of the example is 24. The circles on the nominal 
distance line in Figure 0 represent the real solutions. Table 0 gives all the 24 
real solutions of this example. 

The computation was carried out using a tolerance-driven search for potential 
solutions followed by a Newton iteration refining the initial values. On a PC with 
a 500MHz Pentium 3 the initial search took 100 milliseconds with a tolerance 
of 0.2, and the subsequent refinement took an additional 233 ms. This contrasts 
favorably with the computation times obtained by Durand on a Sun SPARC 20 
using homotopy continuation where 24 paths were evaluated in approximately 
30 sec. The homotopy evaluation on the slower machine was a completely general 
implementation, while our computation of the solution was specifically designed 
for this particular problem. It would be interesting to test this problem on general 
multi-variate interval Newton solvers. 
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Fig. 7. 6 ( 34 ( 0 ) — 9 Curve 



5 Further Discussion 

The Construction by Computation approach can be used more generally. Let 
F{X) = 0 be a system of n nonlinear equations F = {/i, •■•,/«} with n 
unknowns X = {a;i, . . . , x„}. To find all real solutions of F{X) = 0, we can 
choose a real parameter set T = {t\, . . . ,tk}k<n such that X can be solved as 
X{T) = {x\{T), . . . ,Xn{T)} by using n — k equations 

Fn-k = {f\f €F,l<i<n-k}cF 

Let 

Fk = F- F^_u = {P\r ^ F„_k, P (^F,l<j <k}cF 

and define 

Let domain of T be I?t = [timin,timax] x ••• x [tfcmm, tfcmax], and for every 
ti G T define a step size dti such that we can calculate p{T) on Dt numerically 
for every [ti = ti + dti] x ■ ■ ■ x [tk = tk + dtk]- Obviously, T x p{T) C 5?^+^ 
is a hypersurface. Given a small positive real number S, we can get the Coarse 
Solution Set 



= {qr = {T,p{T))\p{T) <S,qTGTx p{T)} 
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Table 2. Real Solution Set of the Example 





Pi 


P2 


P3 


Pa 


1 


(2.06,4.98,1.21) 


(3.30,3.46,-0.34) 


(-4.87,2.43,4.51) 


(3.45, -1.36, 4.28) 


2 


(-2.06,4.98,1.21) 


(-3.30,3.46,-0.34) 


(4.87, 2.43, 4.51) 


^3.45, -1.36,4.28) 


3 


(2.28,4.81,1.77) 


(3.85,2.88,1.96) 


(-2.79,1.73,-4.82) 


(-3.46,1.30,4.30) 


4 


(-2.28, 4.81,1.77) 


(-3.85,2.88,1.96) 


(2.79,1.73,-4.82) 


(3.46,1.30,4.30) 


5 


(2.93,3.48,3.77) 


(3.72,3.18,1.42) 


(-4.95,1.29,4.95) 


(-0.37,3.65,-2.61) 


6 


(-2.93,3.48,3.77) 


(-3.72,3.18,1.42) 


(4.95,1.29,4.95) 


(0.37,3.65,-2.61) 


7 


(3.04,3.04,4.13) 


(4.14,0.94,3.35) 


(-4.93,-1.65,4.85) 


(-1.58,4.32,-1.23) 


8 


(-3.04,3.04,4.13) 


(-4.14,0.94,3.35) 


(4.93,-1.65,4.85) 


(1.58,4.32,-1.23) 


9 


(3.26,1.39,4.94) 


(4.01,2.20,2.70) 


(-3.22,3.90,-3.32) 


(-3.47,-1.07,4.36) 


10 


(-3.26,1.39,4.94) 


(-4.01,2.20,2.70) 


(3.22,3.90,-3.32) 


(3.47,-1.07,4.36) 


11 


(3.29,0.79,5.07) 


(4.15,-0.84,3.38) 


(-4.92,1.82,4.79) 


(1.72,4.39,-0.95) 


12 


(-3.29,0.79,5.07) 


(-4.15,-0.84,3.38) 


(4.92,1.82,4.79) 


(-1.72,4.39,-0.95) 


13 


(3.29,-0.79,5.07) 


(4.15,0.84,3.38) 


(4.92,-1.82,4.79) 


(1.72,-4.39,-0.95) 


14 


(-3.29,-0.79,5.07) 


(-4.15,0.84,3.38) 


(4.92,-1.82,4.79) 


(-1.72,-4.39,-0.95) 


15 


(3.26,-1.39,4.94) 


(4.01,-2.20,2.70) 


(-3.22,-3.90,-3.32) 


(-3.47,1.07,4.36) 


16 


(-3.26,-1.39,4.94) 


(-4.01,-2.20,2.70) 


(3.22,-3.90,-3.32) 


(3.47,1.07,4.36) 


17 


(3.04,-3.04,4.13) 


(4.14, -0.94,3.35) 


(-4.93,1.65,4.85) 


(-1.58,-4.32,-1.23) 


18 


(-3.04, -3.04,4.13) 


(-4.14,-0.94,3.35) 


(4.93,1.65,4.85) 


(1.58,-4.32,-1.23) 


19 


(2.93,-3.48,3.77) 


(3.72,-3.18,1.42) 


(-4.95,-1.29,4.95) 


(-0.37,-3.65,-2.61) 


20 


(-2.93,-3.48,3.77) 


(-3.72,-3.18,1.42) 


(4.95,-1.29,4.95) 


(0.37,-3.65,-2.61) 


21 


(2.06,-4.98,1.21) 


(3.30,-3.46,-0.34) 


(-4.87,-2.43,4.51) 


(3.45,1.36,4.28) 


22 


(-2.06,-4.98,1.21) 


(-3.30,-3.46,-0.34) 


(4.87,-2.43,4.51) 


(-3.45,1.36,4.28) 


23 


(2.28,-4.81,1.77) 


(3.85,-2.88,1.96) 


(-2.79,-1.73,-4.82) 


(-3.46,-1.30,4.30) 


24 


(-2.28,-4.81,1.77) 


(-3.85,-2.88,1.96) 


(2.79,-1.73,-4.82) 


(3.46,-1.30,4.30) 



For every qx G Sc we can get an starting point X^. Using Newton-Raphson 
iteration, we may refine the starting point to a real solution of F{X) = 0. After 
calculating all qx G S'c we can get the real solution set Sr- If the step sizes 
dti,i = are small enough and S is large enough, we can find all real 

solutions of F(A) = 0. 
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Abstract. We propose an algorithm for solving geometric constraint 
problems. The algorithm has linear complexity for constrained systems 
without loops, and is of quadratic complexity for constraint problems 
with loops. This algorithm is complete for constraint problems about 
simple polygons. The key of the algorithm is to combine the idea of graph 
based methods for geometric constraint solving and geometric transfor- 
mations from rule-based methods. 



1 Introduction 



Geometric constraint solving (GGS) is the central topic in much of the current 
work of developing parametric and intelligent GAD systems. It also has applica- 
tions in mechanical engineering, linkage design, computer vision and computer 
aided instruction. There are four main approaches to GGS: the graph analy- 
sis approach llbl 211511^ . the rule-based approach I lldll 411 1 1 1 71911 8l . the numer- 
ical computation approach mrm. and the symbolic computation approach 
pimj . In practice, most people use a combination of these approaches to get the 
best result. 



The basic idea of the graph analysis approach is to represent the geometric 
constraints with a graph and then use algorithms from graph theory to transform 
the drawing problem into certain constructive forms, like constructive forms with 
ruler and compass. In m, graph decomposition algorithms are used to divide 
a constraint problem into three parts, to solve the parts separately, and then to 
assemble the three parts together to form the final diagram. In |21, geometric 
diagrams are divided into clusters and clusters are merged into larger clusters 
until a single cluster is obtained. In US], graph methods are used to decide 
whether the the constraint problems are well-, under- or over-constrained. In 
particular, Hoffmann et al. presented an algorithm to decompose any constraint 
problem into constructible sub-graphs, called dense sub-graphs |E| ■ In most cases. 



* This work is supported in part by a “973” project and by CNSF under an outstanding 
youth grant (NO. 69725002). 



J. Richter-Gebert and D. Wang (Eds.): ADG 2000, LNAI 2061, pp. 16-|2^ 2001. 
(c) Springer- Verlag Berlin Heidelberg 2001 
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this decomposition provides a computational procedure for the corresponding 
GCS problem. 

However, the GCS problem is not solved perfectly. The basic steps of graph 
analysis methods are first decomposing a large constraint problem into smaller 
ones and then assembling them together. In this process, we need to solve systems 
of simultaneous equations. In the 2D case, if the maximal number of equations 
to be solved is equal to or less than two, then the problem is called a constraint 
problem without loops. Otherwise, it is called a constraint problem with loops. 
For constraint problems with loops, we need to solve a simultaneous equation 
system, which could be a difficult task. Therefore, it is still worth to find fast 
methods which may transfer constraint problems with loops to constraint prob- 
lems without loops. 

In this paper, we present a graph analysis method of GCS. For a constraint 
problem without loops, the method has linear complexity and can be used to 
solve problems involving geometric objects and geometric constraints of any 
degree of freedom (DOF). So the algorithm works for both 2D and 3D cases. 
Also, the method works for well and under constraint problems. For a constraint 
problem with loops, the method has quadratic complexity and is complete for 
constraint problems of simple polygons. Generally speaking the graph analysis 
methods have lower complexities and the rule-based approaches can solve more 
types of loops by using geometric transformations. The key idea of our method 
is to combine these two approaches to obtain a fast and powerful loop breaking 
method. 

The rest of the paper is organized as follows. In Section 2, we will present 
the graph representation for the three geometric transformations. In Section 3, 
we will present the algorithm. In Section 4, we will present the conclusion. 

2 Graph Representation of Geometric Transformations 

2.1 Graph Representation of Constraint Problems 

A constraint problem can be represented by a weighted graph P). The graph 
vertices represent the geometric objects, edges represent geometric constraints. 
The weight of a vertex is the DOF of the represented object. The weight of an 
edge represents the DOF eliminated by the represented constraint. For instance. 
Fig. 2 is the graph representation for the constraint problem in Fig. 1. Note that 
the weights for the vertices and edges are not marked explicitly. In 2D case, most 
constraints have one DOF and most objects have two DOFs. 

For the convenience of complexity analysis, an adjacent list representation 
of graphs is used. Fig. 3 is the representation of the graph in Fig. 2, where the 
nodes in the first column are called head nodes. 

2.2 Geometric Transformations 

Geometric transformations are used to solve constraint problems with loops in 
rule based methods mm . Here, we will give the graph representation for three 
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Fig. 1. Lengths of four edges 
and angle {L^yLi) are given 




Fig. 2. The graph represen- 
tation 




Fig. 3. Adjacent list representation of graphs 



geometric transformations, which will be used to solve all constraint problems 
about simple polygons. 

— Rigid Body Transformation. 

If there exists a well-constrained sub-graph which can be solved sequentially 
by ruler and compass, then this sub-graph represents a rigid body in the 
original constrained diagram. In certain cases, we may represent this rigid 
body by a simpler geometric object. 

The following special form of this transformation is especially useful in 
solving constraint problems about polygons. Constraint sets like {|^B| = 
di, /-ABC = a,\BC\ = c? 2 }, = d\,\BC\ = d 2 ,\CA\ = d^} and 

{/ABC = a,\BC\ = d 2 ,/ACB = (3} are called called sas, sss, and asa 
constraints respectively. A series of sas, sss or asa constraints may determine 
a rigid body which can be replaced by a segment if only the two ending 
points are connected to other geometric objects. Such an example is given 
in Fig. 6, which can be solved by using two sas transformations. 

We give a precise description for the sas transformation below. Descrip- 
tions for other transformations are omitted. For an object v, let LDEG(v) 
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Fig. 4. Before the sas trans- 
formation 



Fig. 5. After the sas trans- 
formation 



and PDEG(?;) be the numbers of lines and points connected to v in the 
constraint graph respectively. We need to find a sub-graph satisfying the 
following conditions (Fig. 4 ). 

• \PqPi\, \PiP2\ and ZP0AP2 are known. 

• LDEG(Pi) = 2 , PDEG(Pi) = 2 . 

• PDEG(Zi) = 2 , PBEGih) = 2 . 

Then we may delete P\,li,l2 and add a new line = P0P2 and a new 
constraint IP0P2I. Note that all other constraints about l\^l2 are angle con- 
straints and can be converted to constraints about 




Fig. 6. A problem Fig. 7 . A problem 
using the sas trans- using the angle trans- 
formation formation 



Fig. 8. A problem 
using the parallel 
transformation 



— Angle Transformation. Angle transformations can be used to solve many 
constraint problems with loops. In Fig. 7 , since three inner angles of the 
quadrilateral are known, its fourth inner angle at point Pi is also known. 
With this angle known, the sas transformation can be used to solve the 
constraint problem. 

In general, we may use the concept of full-angles m for angle transforma- 
tion. We use [Li,Lj] to represent the full-angle from line Li to line Lj, which 
satisfies the following properties 



[Li, Lj] — [Lj,Li], 

[Li,Lj] = [Li,Lk] -\- [LkyLj], 
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With the above property, we may easily divide the lines into equivalent 
classes such that the angle between any two lines in the same class is known 
or can be calculated. It is clear that we may find all such equivalent classes 
in linear time of the number of geometric objects and geometric constraints. 

— Parallel Transformation. The problem in Fig. 1 cannot be solved with the 
above transformations, neither by Hoffmann and Owen’s graph triangle de- 
composition methods. The methods reported in [RTTI] may solve this prob- 
lem. But in these methods, we need to solve equation systems with more 
than two equations. In other words, the loop in this problem is not broken. 
In f,'II I T\ . the loop in this problem can be broken by introducing auxiliary 
points. The solution is given in Fig. 8, where line segment P 2 P 3 is translated 
to P\P 2 - Since IP 4 P 1 P 2 is the same as the angle formed by lines P 1 P 4 and 
P2P3, we may solve the problem with an sas transformation. 



In general, the parallel transformation works as follows. Let Pq, Pi, . . . , Pk 
be points in a constraint problem such that 

1. Angle [PoPi,Pk-iPk] is known. 

2. iPo-Pil, IP 1 P 2 I, ■ • ■ , \Pk-iPk\ are known. 

3. The only geometric objects connecting Pi,i = — l are points 

Pi_i,Pi+i and lines P^Pi+i. 

4. There are no points on lines P^P^+i, i = 0, . . . , fc — 1 besides Pi and Pi+i. 

Fig. 9 gives an example for fc = 4, where the dotted lines mean that they 
may not exist. If the above conditions are met, we may make parallelograms 



by P 2 , . . . , Pfe-i' in the original problem to obtain a new constraint problem 
(Fig. 9, middle). It is clear that 





Fig. 9. Parallel transformation 
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1. All the constraints about P2, ■ ■ ■ ,Pk-i can be converted to constraints 

about P 2 , . . . , In particular, IPqPiP^ is known. 

2 . If P21 ■ ■ • I Pk-i are constructed, we may construct P2, ■ • ■ , Pk-i by com- 
puting the intersections of lines. 

Furthermore, we may do an sas transformation for Pq^Pi^P^ to obtain the 
final result of the parallel transformation (Fig. 9, right). 

Suppose that the above Pi have been found. Then we need 0{n + e) steps 
to do the parallel transformation. 

3 The Algorithm and Applications 

3.1 The Algorithm 

For any vertex v in the graph, let DEG(u) be the sum of the weights of all the 
edges connecting to this vertex. Then, a vertex can be determined explicitly by 
the constraints involving it if the following condition is satisfied: 

DEG(u) < DOF(u). (1) 

The basic idea of the algorithm is to repeatedly remove those vertices satisfying 
this condition. The problem has no loops if and only if all the vertices of the 
graph can be removed in this way. If the problem has loops, we will use one of 
the three transformations to reduce the problem to a smaller one. 

1. Generate the adjacent list representation for the constraint graph and DEG(u) 
for each head node v. 

2. Do the angle transformation to find all the equivalent classes Li, . . . , such 
that the angle between two lines in the same class can be computed. 

3. Let V point to the first head node. 

4. If V is not an empty node then goto the next step. Otherwise we are at the 
end of the graph. There are two cases. If all objects are removed from the 
adjacent list, the problem can be solved by the algorithm, because each time 
we remove an object from the list, we actually give a construction method 
for it. If there are still nodes in the list, we cannot solve the problem. 

5. Remove those nodes v satisfying du from the graph and for each node w in 
the adjacent liet of v, minus one from DEG(zc) . To ensure that the complexity 
of this step is linear, we will find the next node satisfying dU from the 
neighboring nodes of u. If all nodes are removed in this way, goto Step 4. 
Otherwise, goto the next step. 

6. If u is a line satisfying 

— There are only two points Pq and Pi on it and 
— |PoPi| is known, 

do the next step. Otherwise set v to be the next head node and goto Step 4. 

7. Staring from Pi (or Pq), find points P 2 , ■ ■ ■ ,Pk satisfying the conditions of 
the parallel transformation. 

8. If A: = 1, nothing need to be done. Let v to be the next head node and go 
back to Step 4. 
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9. If fc = 2, we may do an sas transformation to Pq, Pi,P 2 - Put the new intro- 
duced line P 0 P 2 into the equivalent class of PqPi and also at the end of the 
adjacent list. Let v be the next head node and go to Step 4. 

10. Do a parallel transformation to Pq, P\, . . . , Pk and update the constraint 
graph. Put the new introduced line P 0 P 2 into the equivalent class of PqPi 
and also at the end of the adjacent list. Let v to be the next head node and 
go to Step 4. 

Let n be the number of geometric objects and e the number of constraints in 
the problem. The main loop of the algorithm starts from Step 4. There are four 
possibilities for each loop. (1) Do nothing. (2) Remove several nodes in Step 4. (3) 
Do an sas transformation. (4) Do a parallel transformation. In cases (2), (3) and 
(4), the number of geometric objects in the adjacent list will strictly decrease. 
Therefore, the maximal number of this loop is the number of the geometric 
object, n. 

We will show that the complexity of each loop is 0{n + e). Also, Steps 1 and 
2 have complexity 0(n -I- e). Therefore, the complexity of the algorithm would 
be 0(n^ -|- ne). In Step 5, we may eliminate at most n nodes. For each node 
eliminated, we need to search its adjacent list. Let di,i = 1, ... ,n be the steps 
needed to eliminate the i-th node. Then di is actually propotional to the number 
of constraints involving the i-th node plus a constant. Then the complexity of 
this step is ~ + ®)- complexity of Step 5 is (e). In Step 6, we 

need to find a path PqPi ■ ■ ■ Pk in the graph, each I < i < fc — 1 does not 
connect to other points. We may do this in 0(n -|- e) steps. The sas and parallel 
transformations need 0{n + e) steps. Therefore, each loop will finish in O(n-l-e) 
steps. 

From the above analysis, if a constraint problem has no loops, then it can be 
solved by the above algorithm in linear time. The quadratic complexity is mainly 
due to the fact that no backtracking is needed in Step 4 of the algorithm. We 
need to show that all possible sas and parallel transformations will be executed 
in the algorithm. This is because after each sas or parallel transformation, two 
equivalent classes for lines will not be merged. Therefore, if we cannot do sas 
and parallel transformations for a line before another transformation, we cannot 
do that after the transformation. 

3.2 Constraint Problems for Simple Polygons 

We will show that the algorithm reported in Section 3.1 can be used to solve all 
constraint problems for simple polygons. Let Pq, Pi, . . . , P„_i be n points in 
the plane. The constraint problem for simple polygon with Pi as vertices consists 
of two types of constraints only: 

1. Edge constraints, i.e., |PiPi_|_i| is known, where the subscripts are understood 
to be mod n. 

2. Angle constraints, i.e., the angle formed by P^Pi+i and PjPj+i is known. 

In jl /) . it is pointed out that all constraint problems about simple polygons can 
be solved by the parallelogram rules. This method is based on deductive rules 
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and has no complexity analysis. Our algorithm is a graph based theory and is of 
quadratic complexity. 

We first consider well-constrained problems for simple polygons which have 
2n — 3 constraints. We divide the problem into three cases. 

1. The problem has n edge constraints and n — 3 angle constraints. Since the 
lengths of all edges are known, we may construct the diagram with sas and 
parallel transformations. 

2. The problem has n — 1 edge constrains and n — 2 angle constraints. Since 
the length of one edge is unknown, parallel transformations may be used to 
each pair of the edges. After a series of parallel and sas transformations, the 
problem becomes one of the following cases. 

(a) Both of the two unknown angles involving the edge whose length is 
unknown. Fig. 10(a). This is a problem without loops. 

(b) One of the two unknown angles involving the edge whose length is un- 
known, Fig. 10(b). This problem may be solved with one sas transfor- 
mation. 

(c) The lengths for all the edges involving the two unknown angles are 
known. Fig. 10(c). This problem may be solved as follows. By an an- 
gle transformation, the angle formed by h and I 2 is known. Now, this 
problem can be solved by a parallel transformation. Notice that this 
problem can also be solved by the methods in mm- 






Fig. 10. Constraint problems for simple polygons 

3. The problem has n — 2 edge constraints and n — 1 angle constraints. Us- 
ing angle transformations, angles formed by any two edges are known. The 
lengths of two edges are unknown. These two edges divide the vertices into 
two sets such that two vertices in the same set can be connected with a series 
of edges with known lengths. Therefore, we may use parallel transformations 
to the two sets. By a series of parallel and sas transformations, the problem 
becomes one of the following cases. 

(a) The two edges with unknown lengths have a common end point. Fig. 
11(a). This is a problem without loops. 

(b) The two edges with unknown lengths have no common end point. Fig. 
11(b). This problem can be solved similar to Fig. 9 by a parallel trans- 
formation about li and I 2 since the angle between them are known. Note 
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that in the current case, 1^1^21 in Fig. 9 is not known. But we may still 
use the parallel transformation to solve it. We may add this diagram as 
a special case to our general algorithm. This problem can also be solved 
by the methods in mrm . 





Fig. 11. Constraint problems for simple polygons 



For an under-constrained problem, we may first transform it into a well- 
constrained problem by adding a proper number of constraints ca, and then 
solve it with the method presented in this paper. 

4 Conclusion 

The algorithm proposed in this paper uses a hybrid method by combining the 
graph analysis approach and the rule-based approach. As a result, it has the 
advantage of both approaches: it is fast and capable of solving a large number of 
constraint problems with loops. Actually, it is complete for all constraint prob- 
lems of simple polygons. We believe that this approach has more potential due 
to the fact that there should exist many techniques of geometric constructions 
in the classical geometry, which are useful to GCS. One possible extension is to 
introduce more transformations such that the algorithm may solve all the prob- 
lems within the scope of Hoffmann-Owen’s triangle decomposition approaches 
and is still of quadratic complexity. 
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Abstract. Following the work of Gonzalez- Vega, this paper is devoted 
to showing how to use recent algorithmic tools of computational real 
algebraic geometry to solve the Birkhoff Interpolation Problem. We recall 
and partly improve two algorithms to find at least one point in each 
connected component of a real algebraic set dehned by a single equation 
or a system of polynomial equations, both based on the computation of 
the critical points of a distance function. 

These algorithms are used to solve the Birkhoff Interpolation Problem in 
a case which was known as an open problem. The solution is available at 
the U.R.L.: http://www-calfor.lip6.fr/~safey/applications.html. 



1 Introduction 

The problem of interpolating a function / : R — > R by a univariate polynomial 
from the values of / and some of its derivatives on a set of sample points is one 
of the main questions in Numerical Analysis and Approximation Theory. 

Let X = {x \, . . . , x„} be a set of real numbers such that xi < ■ ■ ■ < Xn, r an 
integer, and let I C {1, . . . , n} x {0, . . . , r} be the set of pairs (i,j) such that the 
value fij = f^^\xi) is known. The problem of determining the existence and 
uniqueness of a polynomial Q in R[X] of degree bounded by r such that 

is called the Birkhoff Interpolation Problem. 

In jl 7j . Gonzalez- Vega focuses on determining, for fixed integers n and r, 
the family of I’s for which this question is solvable for any choice of x and the 
values fij. To this end, he shows that the problem can be reduced to deciding 
if some hypersurfaces contain real points with non-zero coordinates. In HD the 
cases n = 2, r £ {1, . . . , 6}, n = 3, r £ {1, 2, 3} and n = 4, r £ {1, . . . , 4} are 
solved, using techniques adapted from the Cylindrical Algebraic Decomposition. 



J. Richter-Gebert and D. Wang (Eds.): ADG 2000, LNAI 2061, pp. 26-|4^ 2001. 
(c) Springer- Verlag Berlin Heidelberg 2001 
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In 1998, the case n = 5 and r = 4 was presented as an open problem in CHI. 
The aim of the paper is to show how we have solved this case. 

The most popular algorithm deciding the emptiness of semi-algebraic sets — 
as a particular case of deciding the truth of a first order formula — is Collins’ 
Cylindrical Algebraic Decomposition (CAD) |1 211 1| whose complexity is doubly 
exponential in the number of variables in terms of basic arithmetic operations 
and size of output. 

In Grigoriev and Vorobjov’s paper m appeared new algorithms, based on 
the critical point method. These algorithms have a single exponential complexity 
in the number of variables, in terms of basic arithmetic operations and size of 
output. Still, in poi, Hong shows that the algorithms proposed in the papers 
m and PEI are not usable in practice. According to the experiments in 
the same conclusions apply for more recent methods given in jbldPIlDj . These 
algorithms adopt strategies of the following kind: 



— In the first place, solve the problem in favorable cases, such as a compact 
and smooth variety. 

— Get back from general situations to the favorable cases using various tricks, 
such as infinitesimal deformations or sums of squares. 

The papers of the TERA group m treat the case of a smooth complete inter- 
section variety, and propose an algorithm based on evaluation techniques, whose 
complexity is polynomial in terms of intrinsic real degrees. Still, these favorable 
situations, in particular compactness, are not easily detectable, and systemati- 
cally applying the tricks above makes the computations difficult in practice. 

In the papers I2SEI, two algorithms inspired by the ideas in |ibll9lj4l9liI0i are 
proposed. Both of them are based on the computation of the critical points of a 
distance function (thus avoiding the hypothesis of compactness) and improve the 
aforementioned algorithms. The first algorithm m computes at least one point 
in each connected component of a real algebraic set defined by a single equation. 
The second applies to a real algebraic set defined by a polynomial system, 
in the spirit of unng. The experiments in j'2?Slti) show that these strategies 
are competitive with the CAD on small examples and allow to deal with more 
significant ones, unreachable with the CAD. 

In this paper, we pursue the investigations of m by analyzing the practical 
behavior of these two recent algorithms on the Birkhoff Interpolation Problem. In 
sections0 2]and|Hl we recall the algorithm given in m, our contribution being a 
new way to solve a system with infinitesimal parameters arising in its course, and 
then the algorithm given in |Bj. We conclude this paper with our experimental 
results, which solve the case n = 5, r = 4 of Birkhoff’s problem. This gives us 
the opportunity to compare the size of the outputs and computational times of 
both algorithms. 

Throughout this paper, the base field is the rational field Q. The algorithms 
presented here generalize to the case of an ordered field K, replacing the field M 
by the real closure of K and the complex field C by its algebraic closure. 
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2 The Birkhoff Interpolation Problem 

2.1 Formulation 

We want to determine the sets I for which the Birkhoff Problem Interpolation 
admits a unique solution for all choices of x and of the fij. To this end, we 
follow closely and adopt its convenient matricial formulation. 

Consider the matrix £ = (cij) with n rows and r + 1 columns Y) . filled 
with O’s and I’s, such that = 1 if and only if (i, j) £ X. The problem admits 
a solution only if £ has as many I’s as columns. This amounts to saying that 
the coefficients of the interpolating polynomial Q are solution of a linear square 
system, with associated matrix Mg. This matrix is parametrized by x &nd its 
shape depends on £. We are interested in determining the matrices £ for which 
the determinant of Mg is non-zero for all Xi in which case the matrix £ is said 
to be poised. 

Example 1 . Let n = 4 and r = 3 and consider the matrix: 

/l 1 0 0\ 
f = 0 0 1 0 

\0 1 0 0/ 

Let Q{x) = Qq + aix + a2x'^ + be the generic polynomial of degree 3. Writing 

Q^^\xi) = fij if and only if = 1, we obtain the following linear system: 

{ Oq + a\Xi + Q2 x\ + asxf = /ip 
oi -I- 202 X 1 -I- 303X1 = /ip 
Oi -I- 202X3 -I- 3 o 3 x| = / 3,2 
2o2 -I- 603X2 = h,2 

whose matrix is: 

/ 1 Xi xf xf \ 

0 1 2xi 3xf 
0 1 2x3 3x§ 

^^0 0 2 6x2 / 

The interpolation problem is solvable if and only if 

12x3X2 + 6x1 — 12x2X1 — 6x3 

does not vanish for all values xi,X2,X3 satisfying xi < X2 < X3. 

In , Gonzalez- Vega shows that the question can be reduced to testing if a 
particular factor of the determinant of the matrix M.g has real roots with non- 
zero coordinates. Replacing (xi, . . . , x„) by (xi, Xi + t\, . . . ,x\+t\ + . . . + t^_i) 
yields a homogeneous polynomial in (ti, . . . , Letting = 1, we are brought 
to test if a hypersurface defined by a polynomial P G K[t2, ■ ■ • ,tn-i] has real 
roots with non-zero coordinates. 
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2.2 Sketch of the Resolution 

In order to determine all the poised matrices in the case n = 5 and r = 4, we 
have to study quasi-algebraic sets defined by a unique equation in M[t 2 , ta, ^ 4 ] and 
several inequations. While algorithms deciding the emptiness of real algebraic 
sets have known recent significant progress PSEBEI, more algorithmic work is 
necessary in the semi-algebraic case |2|. 

We thus treat this question using and adapting the algorithms for the al- 
gebraic case described in m and jOj. The algorithm described in |2H1 will be 
named Algorithm 1. It takes as input a single polynomial equation and returns 
at least one point in each connected component of the real algebraic variety de- 
fined by the equation. The algorithm described in 0 will be named Algorithm 
2. It takes as input a polynomial system of equations and returns at least one 
point in each connected component of the real algebraic variety defined by the 
system. 

To solve our problem, given a polynomial P in we adopt the 

following scheme. 

First Step. We study the hypersurface defined by P = 0, using either Algo- 
rithm 1 or Algorithm 2. If this subroutine returns no real point, or a real point 
with non-zero coordinates, we can give a positive (resp. negative) answer to the 
Interpolation Problem. 

Second Step. We are in the case when all the real points we obtained have at 
least one coordinate equal to zero. Using Algorithm 1, we study the hypersurface 
defined by + {Ttit 2 ts — 1)^ = 0; when Algorithm 2 we study the polynomial 
system P = 0 and Ttit 2 t 3 — 1 = 0. 

Note that in some situations, we can avoid such extra computations, using 
for example the following result: 

Lemma 1. Let P G K[ti , . ■ . ,tn] and V(P) the hypersurface defined by P = 0. 
Let M = (^ 1 , . . . , p,n) G V(P) nK" such that gradM(P) yf 0, and L C {1, . . . , n} 
the set of indexes for which pi is zero. //grad]yj(P) is coUinear to none of the 
axes {ti)i^i, then there exists a point M' in V(P)nK" with non-zero coordinates. 

3 Preliminary Results 

This section describes the basics of the critical point method, valid for both 
Algorithm 1 and Algorithm 2. 

Let P be a square-free polynomial in Q[xi, . . . , a;„], and V(P) C C” the 
complex variety defined by P = 0. Our strategy to compute at least one point in 
each connected component of V(P) nK” relies on the computation of the critical 
points on V(P) 0 K” of a “distance function”. Given a point A = (oi, . . . ,a„) 
in K", the function c?a : M ||AM|p admits a minimum on each connected 
component of V(P) fl K". These minima are solutions of the system 

5(P, A) = {P(M) = 0, gradM(P)//AM}, 
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where the condition grad]y[(P)//AM is expressed by setting the determinants 
to zero. The set of all complex roots of this system is denoted by C{P, A). 

Two cases will be distinguished, according to the dimension of the singular 
locus of V(P). The following result from m deals with the first case, where 
there is a finite number of singularities: 

Theorem 1. Let P be a square-free polynomial in Q[xi, . . . , Xn]- If V(P) eon- 
tains a finite number of singular points, there exists a Zariski-dense set T' C C" 
such that for A in T' , the system S{P,A) is zero -dimensional. 

Moreover, ifV{P) is a smooth hypersurfaee, there exists a Zariski-dense set 
T C C” such that for A in T, the system S{P,A) is zero-dimensional and 
radical. 

Hence, if V(P) contains a finite number of singular points, one can choose 
a point A such that S{P,A) is zero-dimensional. Since S{P,A) intersects each 
semi-algebraically connected component of V(P)P|M", the problem is reduced 
to isolate the real roots of S{P, A). 

Throughout this paper, we will represent the solutions of a zero-dimensional 
system with coefficients in a field k using primitive element techniques, in a way 
that can be traced back to Kronecker | 22 |. Such a representation consists in: 

— a linear form u = ^ UiXi which separates the zeros of the systerr0; 

— its minimal polynomial q in k[t] and the parameterizations (ui,...,u„) in 



k[t]^, where degu^ < degg, such that the zeros of the system are described 

by 



where q{t) is the derivative of the square-free part of q{t). We will denote 
this kind of representation (u, TZ), where TZ is the vector [q,vi, . . . , u„]. 

Modern presentations and algorithms include the Rational Univariate Rep- 
resentation or the Geometric Resolution 1 1 41 1 0| . which coincide in the 

case of radical ideals of dimension zero. Such representations are useful, in that 
they allow to count and isolate the real roots of the system using for instance 
Sturm-Habicht sequences m- 

When the singular locus of V(P) is of positive dimension, difficulties arise, as 
the system S{P, A) is also of positive dimension for any choice of the point A. In 
the following sections, we focus on this case, and present in turn the strategies 
used in Algorithms 1 and 2. 

— The algorithm described in m performs an infinitesimal deformation on the 
hypersurface V(P) to get back to a smooth situation; the required informa- 
tion is extracted from the solution of the deformed problem. 

— The approach from j0| is based on the iterated study of singular loci, as 
varieties of lower dimension. 

^ The image of two distinct zeros by u are distinct. 



q{t) = 0, 
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4 Algorithm 1: Using Infinitesimal Deformations 

In the first subsection, we recall the main steps of the algorithm in to which 
we refer for further details. We then present our solution to the specific sub- 
problem of computing a univariate representation depending on the deformation 
parameter. 



4.1 Overview of the Algorithm 

Let £ be an infinitesimal; we denote by C(£) the algebraically closed field of 
algebraic Puiseux series with coefficients in C. The sub-ring of elements of non- 
negative valuation (called “bounded elements”) is naturally equipped with the 
operation denoted lime_>o. If C is a set of elements of C(e), lime_>oC denotes the 
set of the limits of the bounded of elements of C. Finally, if x is ^ 

C(£), where iq G Z, g G Q, G C, we denote by o{x) the rational number io/q. 

The following result m shows that the study of S{P — e, A) enables to solve 
the original problem. 

Proposition 1. The set (lim£_>o C(P — £, A)) fl M" intersects each connected 
component ofV{P) flM". 

Since V{P — e) C C(£)" is smooth. Theorem Q] implies that for a generic 
choice of the point A, the system S{P — £, A) is radical of dimension zero. In 
this case, its solutions can be described by a univariate representation (it, 72.), 
with coefficients in Q(e). The paper m then gives an algorithm to compute the 
limits of the bounded solutions it describes, when m is a well separating element. 
This is the case when: 



— for all a G C(P — £, A), o{u{a)) = min(o(Ai(a)), z = 1, . . . , n), 

— for all (a,/?) G C{P — £, A)^, u{a) and u{f3) are infinitesimally close if and 
only if a and (3 are infinitesimally close. 

Indeed, from a univariate representation associated to a well separating element 

{ g(e,t)xi = vi{e,t), 

\ 

q{e,t)xn = Vn{e,t), 



if gig| ... g™ is the square-free decomposition of g(0, 7), then, Vg G {!,..., m}, 
one can compute polynomials and such that the limits of the bounded 
solutions are represented by 



/ 

qj{t) = 0 , 

V 



^^\0,t)xi = v[^\o,t), ^ 



We can now give the first algorithm, which is the synthesis of all the previous 
points: 
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Algorithm 1 

Input: A squarefree polynomial P 

Output: At least one point on each connected component of V{P) 

1. Find by trial and error a point A such that S{P — £, A) is zero-dimensional 
and radical. 

2. Compute a parametric resolution {u,TZ) of its roots, with coefficients in Q(e). 

3. If u is a well-separating element for S{P — e, A), compute the limits of the 
bounded solutions described by 7?. as e 0. 

4. else change u, check if it is a separating element for S{P — e, A) and return to 
step 3. 



We detail a solution to point 1 above. In I2H1 the authors have proved the 
following results: 

Lemma 2. If G is a Grobner basis of the system S{P—e,A) in Q[e, xi, . . . , x„], 
for a block- ordering such that [e] < [xi, . . . , x„], then G is a non-reduced Grobner 
basis of S{P — e, A) in Q(e)[xi, . . . , x„]. 

If there exists a value £q in Z which doesn’t cancel any of the leading coeffi- 
cients of the polynomials in G, and such that the system S{P — £q, A) is radical 
of dimension zero, then S{P — e, A) is radical of dimension zero. Moreover, for 
such £o o,nd A, ifu is a separating element for S(P—£q, A) then u is a separating 
element for S{P — e, A). 

Given any point A that fits the hypothesis of Theorem P one can show that 
only a finite number of Eq will not fit the conditions of Lemma El Hence, a 
simultaneous search by trial and error of A S Z” and £o S Z can be performed 
to obtain a point A such that S{P — e, A) is radical. For a given A, this requires 
to compute the basis G; testing that S{P — Eq, A) is radical can be done using 
Hermite’s quadratic form (see |2ti] for example). 

4.2 Computing a Parametric Resolution 

We now turn to the second of the tasks mentioned above, namely computing a 
resolution parametrized by e. In Schost proposes a probabilistic algorithm 
to do so, based on the work of the TERA group The algorithm relies on 

a formal Newton approximation process, which is an analog of numerical root- 
finding techniques, and reminiscent of Hensel lifting methods. In the sequel, we 
first recall the main steps of this algorithm, then provide solutions to certify its 
output in our case (radical and zero-dimensional ideal). 

Overview of the Algorithm. For a random choice of Eq in Q, given any resolution 
{u,TZo) of the system ^(P — eo, A), there exists a resolution {u,TZ) of the system 
S{P — e, A) with coefficients in Q(s) whose specialization at Eq is {u,TZo)- The 
strategy presented in consists in approximating the solution (u, IZ) starting 
from (u, IZq). 
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The output 72. can be rewritten in terms of e' = e — Sq, as a vector 72' of 
polynomials in Q(e')[t], where none of the denominators of the expressions in 
e' vanishes at zero. Denote by 72' the vector of polynomials of Q[[e']][7], where 
all coefficients are expanded at precision 2*. The initial value is obtained by 
solving the system S{P — eo, A); the formal Newton operator, denoted Lift(72') 
computes 72'^^ from the argument 72'. 

The whole algorithm is organized around a while loop. Each pass begins by 
computing the resolution 72'^^! of precision 2®“''^. The RationalReconstruction 
subroutine then computes Fade approximants of all the coefficients, as ra- 
tional functions in e', with numerators and denominators of degree at most 2®; a 
boolean value b indicates success. If the reconstruction is possible, the subroutine 
StopCriterion, detailed below, tests if the resolution, rewritten in terms of e, 
is correct. If this is not the case, we go trough another loop. 

A Certified Result. The probabilistic aspect of the algorithm in ISB] is twofold: it 
lies in the choice of the specialization value Eq, and in the test StopCriterion. 
We provide here certified versions of these subroutines. 

The value Eq must satisfy some genericity conditions: the system S{P—Eo, A) 
must be zero-dimensional, its roots must be simple and in maximal number. The 
algorithm in m is based on the fact that all choices of Eq but a finite number 
fulfill these conditions, and that a bound on the number of bad values is readily 
available. Instead, we use the following result: if Eq cancels none of the leading 
coefficients of the polynomials in the basis G computed above, and if the system 
S(P — £o, A) is radical, then Eq satisfies the genericity conditions. 

Finally, to check that a solution (u = ^ UiXi,7Z) is the correct solution, it 
is enough to check that the minimal polynomial and the parameterizations in 
72, with t evaluated at reduce to zero modulo the basis G. This implies 

that the resolution 72 describes a set of points containing C(P — e, A). As these 
two sets have the same cardinality, we are done. 

The output is a list of polynomials in Q(e)[t]. The degree in t, the degrees in 
E of the numerators and the denominators of the coefficients are bounded by the 
Bezout number d”, where d is the degree of the polynomial P IBBl- The “size” 
of the output is thus 0{n{d+ 1)^") elements of the base-field Q. 

The details of the computations done in Newton’s operator are given in 
Following the usual numeric Newton operator, they rely on the evaluation of the 
vector Jac(S'(P — £, A))~^S{P — e, A) in suitable quotient rings, where Jac(S') 
denotes the Jacobian matrix of the system S. 
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Computing the parametric resolution 

Input: a point A such that the roots of the system S{P — e, A) are simple. 

a Grobner basis G of S(P — e, A) in Q(e)[a;i, . . . , Xn]- 
Output: a parametric resolution (m, TZ) of S{P — e, A) 

1. Find by trial and error a random rational number £o such that the system 
S(P — £o, A) is zero-dimensional and has simple roots, in maximal number. 

2. Compute a univariate representation TZ'q of the roots of S(P — £o, A) 

3. Use the Newton-Hensel lifting process to compute the successive approxima- 
tions TZ': 

i •«— 0; finished -f- false 
while not finished do 
TZ'i+i ^ Lift(7^') 

b,7Z' RationalReconstruction(7?.i_|_i) 
if b then 

TZ Substitute e' by e -I- eq in TZ' 
finished •<— StopCriterion(7?.) 
end if 
i ■<— i -I- 1 
end while 

4. return (u, TZ) 



As in jZlISEl], we use the Straight-Line Program model to encode the input 
polynomial P. In this model, the complexity of the whole lifting process is poly- 
nomial in the size of the output, in the number of variables n, and the number of 
operations L necessary to evaluate the polynomial P pg. Still, the whole algo- 
rithm requires the precomputation of the Grobner basis G, and the subroutine 
StopCriterion relies on normal form computations. This part dominates the 
whole cost of the algorithm. 



5 Algorithm 2: Iterated Study of Singular Loci 

The second approach generalizes the critical point methods used here to the case 
of polynomial systems. In the presence of infinitely many singular points, the 
infinitesimal deformation is avoided by studying the singular locus as a variety 
of lower dimension. 

Let V C C” be an equidimensional algebraic variety of dimension d and 
P = {Pi, . . . , Ps} polynomials in Q[xi, . . . , Xn] such that T(V) = (Pi, . . . , Pg). 
Following the notation of the two previous sections, given any point A in C", 
we define the polynomial system: 



S{P, A) = { Pi(M) = . . . = Pg(M) = 0, 

rank(gradM (Pi grad jvj(P^), AM) <n-d], 
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where the rank condition is expressed by setting to zero the (n—d+l) x (n—d+l) 
minors of the matrix [Jac(P), AM]. The roots of this system form a set denoted 
C(V,A). 

The algorithm proposed in is based on the following theorem: 

Theorem 2. The setC{V, A) meets every eonneeted eomponent o/VflM". More- 
over, there exists a Zariski-dense subset T of C" such that, for all A in T , 
C(V,A) can be written Sing(V) U Vq, where 

- Vo is a finite set of points in C", 

- Sing(V) = {M e V I rank(grad]y[(Pi), . . ,,grad^{Ps)) <n-d}. 

In particular, in this case, dim(C(V, A)) < dim(V). 

Suppose that V C C" and A G K" satisfy the conditions of Theorem 0 
and that Vi is a finite set of points that meets each connected component of 
Sing(V) nK". Theorem 0 implies that Vi U Vo meets each connected component 
of V n K" . The set Vi can in turn be obtained by applying Theorem 0 to each 
equidimensional component of Sing(V). The algorithm in ^ consists in applying 
inductively the above process, performing at each step equidimensional decom- 
positions of intermediate varieties C(Vi, A^). In the end, we obtain a family of 
zero-dimensional sets which meets each connected component of V nK". 

At each step, we need to apply a subroutine taking as input a polynomial 
system S and returning a set of generators of radical equidimensional ideals 
whose intersection is \fS. This can be done using the algorithms mentioned in 
pTTTj or by performing a decomposition into regular and separable triangular 
sets and computing a Grobner basis of the saturated ideal of each 

triangular set. We denote by EquiDimDecomposition such a radical equidimen- 
sional decomposition algorithm. 



Algorithm 2 

Input: A polynomial system P 

Output: At least one point on each connected component of V{P) 

1. list •<— EquiDimDecomposition(P), result ■<— [] 

2. while list [] do 

— P <r- first(list) and remove P from list, 

— if dim(P) = 0 then result <— result U P 

— else find by trial and error a point A such that dim(C(V(P), A)) < dim(P) 
and list <— list U EquiDimDecomposition(S'(P, A)) 

3. count and isolate the real roots of all the polynomial systems in result. 



6 Experimental Results 

6.1 Methodology and Basic Algorithms 

Both algorithms presented above have been implemented, using the following 
software tools: 
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— Gb/AGb: implemented in G++ by J.-G. Faugere 0 and devoted to Grobner 
basis computations; 

— RS: implemented in G by F. Rouillier, devoted to computing Rational Uni- 
variate Representations, and to counting and isolating real roots of univariate 
polynomials; 

— Kronecker: implemented in Magma by G. Lecerf devoted to computing 
Geometric Resolutions, from which we borrowed the formal Newton iterator. 

The subroutine EquiDimDecomposition was implemented using Maple and a file 
connection with Gb, following the algorithm described in ISH and based on the 
results in |SE^. 

All the computations were done on the computers of the UMS MEDIGIS | 2 |, 
on a PG Pentium II 400 MHz with 512 MB of RAM. 

6.2 Solution of the Problem 

The case n = 5, r = 4 of Birkhoff’s problem generates 53130 matrices, which 
produces as many hypersurfaces of to study. 

— 42925 of these hypersurfaces are defined by constant polynomials. 

— For the non-constant polynomials, to avoid unnecessary computations, we 
specialized all variables but one at random non-zero values and applied Us- 
pensky’s algorithm on the univariate polynomials we obtained, looking for 
non-zero real roots. At the end of this preprocessing, about one thousand 
hypersurfaces remained to study. 

— About 900 of these hypersurfaces had zero or a finite number of singularities. 
In all these cases, the first step given in section E~^ wa,s sufficient to conclude: 
the situation where C{P,A) had exclusively real roots with a coordinate 
equal to zero was never encountered. 

On a PG bi-Pentium 400 MHz with 512 MB of RAM, 4 hours are necessary to 
perform these 3 steps for all hypersurfaces. 

— There remained 102 hypersurfaces containing an infinity of singularities. We 
will see below that our implementation of Algorithm 1 can not solve all of 
them, whereas Algorithm 2 succeeded in all cases. In 60 out of these 102 
cases, we had to go through the second step given in section lO 

On the same machine, 2 additional hours are necessary to perform this final 
step with Algorithm 2. 

As a conclusion, 19092 out of the 53130 matrices are poised. Their complete 
list can be found in Maple format at the web page p. 

6.3 Comparing Algorithm 1 and Algorithm 2 

We give a more detailed account of the behavior of Algorithms 1 and 2 on a 
sample of the family of hypersurfaces with infinitely many singularities. These 
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hypersurfaces are denoted Birk.3-l,...,Birk3-15. All of them have degree less than 
8; the whole list can be found at the web page p. 

Tabled summarizes our results on this family. The sign oo indicates that the 
computations were stopped after 24 hours. 

— Algorithm 1: The first column gives the number of bounded roots we ob- 
tain, which is a measure of the size of the output. The second and third 
columns give the degrees in t and e of the generic resolution, which is a 
measure of the size of intermediate data. The last column gives the time 
necessary to perform all computations, in seconds. 

— Algorithm 2: The first column gives the sum of the degrees of the zero- 
dimensional systems produced in the course of the computations; the second 
indicates the total time of computations, given in seconds. 



Table 1. Algorithms 1/2: Size of the output and computation times 



Hypersurface 


Algorithm 1 


Algorithm 2 


Birk.3-1 


12 


16 


3 


5.6 


12 


0,08 


Birk.3-2 


7 


16 


3 


5.2 


7 


0,13 


Birk.3-3 


25 


34 


5 


32 


25 


0,37 


Birk.3-4 


16 


36 


5 


46 


16 


0,18 


Birk.3-5 


31 


40 


5 


116 


31 


0,46 


Birk.3-6 


37 


52 


7 


149 


37 


0,86 


Birk.3-7 


38 


52 


7 


115 


38 


0,72 


Birk.3-8 


45 


130 


19 


3927 


45 


7,11 


Birk.3-9 


47 


132 


19 


2945 


47 


7,88 


Birk.3-10 


48 


136 


31 


18843 


48 


8,04 


Birk.3-11 


50 


138 


31 


26536 


50 


8,88 


Birk.3-12 


50 


138 


31 


17508 


50 


10,01 


Birk.3-13 


32 


252 


29 


CX3 


32 


9,26 


Birk.3-14 


60 


264 


31 


OO 


60 


67 


Birk.3-15 


60 


272 


31 


OO 


60 


83 



For the last examples, the time spent in Algorithm 1 in the checking phase 
becomes largely predominant. Other strategies to certify this output, based on 
a sufficient number of sample checks, could lower this time. Even without this 
certification phase, the computation is longer than with Algorithm 2. Still, with 
regard to the degrees in t and e of the parametric resolution, we consider that 
our implementation of Algorithm 1 shows good performance. 

A relevant criterion to analyze the algorithms based on the critical point 
method is the degrees of the zero-dimensional systems they produce. For Algo- 
rithm 1, this is the cardinality of the set of bounded roots lime_,.o C(P — e. A). 
To this regard, the outputs of Algorithm 1 and Algorithm 2 are of similar size. 
The size of the intermediate data in Algorithm 1, such as the degree of the para- 
metric resolution, is bigger, as several points of C(P — e, A) collapse on a same 
point when e — > 0. 
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Nevertheless, the degrees of the output and of the intermediate bivariate 
polynomials in Algorithm 1 are bounded by d", while we have no similar bound 
for Algorithm 2. An open problem is to precise such a bound for Algorithm 2. 
In all these examples, the dimension of the singular locus was 1, so that there 
was at most one recursive call in Algorithm 2. Experiments with more intricate 
singular loci should tell us more about this question. 

7 Conclusions 

The case n = 5 and r = 4 of the Birkhoff Interpolation Problem is now automat- 
ically solved. The case n = 6 and r = 5 requires to study 1947792 hypersurfaces 
in C^; this combinatorial number is now the limiting factor. More research on 
qualitative nature should be devised to have a better control on this number; in 
this sense, the conclusions and suggestions in HH are still a topical question. 

This problem gave us the opportunity to compare two recent algorithms 
of computational real algebraic geometry and illustrate their practical use. It 
appears that the algorithms based on the critical point method can now solve 
application problems. 

In particular, we have implemented computations with an infinitesimal, con- 
sidering it as a parameter. Another approach consists in implementing an in- 
finitesimal arithmetic; we refer to I2ZI for such a realization in Axiom. Never- 
theless, obtaining good performance in practice using this type of arithmetic is 
still a computer science challenge. 

Besides, the use of infinitesimals in computational real algebraic geometry is 
not exclusive to the desingularization of hypersurfaces: they are required in sev- 
eral algorithms to decide the emptiness of semi-algebraic sets, such as pi9l9l30| . 
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A Practical Program of Automated Proving 
for a Class of Geometric Inequalities* 



Lu Yang^ and Ju Zhang^ 

^ Chengdu Institute of Computer Applications, Chinese Academy of Sciences 
2 GCTECH Info. Tech. Ltd., Beijing 100013, China 



Abstract. An inequality-proving algorithm based on cell decomposition 
and a practical program written in Maple are presented, which can ef- 
ficiently treat inequality-type theorems involving radicals, especially, a 
class of geometric inequalities including most of the theorems in a well- 
known book on the subject. 



1 Introduction 

In the last 20 years, the efficiency of automated theorem proving for equality- 
type theorems has increased greatly leaving behind inequality-type theorems, 
especially, for geometric theorems. This work is an effort to shorten the distance. 

Automated theorem proving on inequalities is always considered a difficult 
topic in the area of automated reasoning. Relevant algorithms depend funda- 
mentally on real algebra and real geometry, and the computational complexity 
increases very quickly with the dimension, that is, the number of parameters. 
Besides the memory saving, the speed improvement in theorem proving some- 
times is also of importance. When a problem requires verification of a batch 
of non-trivial propositions, an inefficient algorithm cannot handle it within the 
time allowed by human patience. For recent progress made in this aspect, see 

mmisrmm- 

When the hypotheses contain polynomial equations, one may think about 
eliminating some variables to make the dimension lower. However, we usually 
have to deal with irrational algebraic functions, such as parametric radicals. 

Example la. Given real numbers a;, y, z, ui, U2, U3, U4, M5, ug satisfying the fol- 
lowing 15 conditions 

' [xy -i-yz + xzyuf — x^{y + z){xy xz ^yz) = 0, 

(xy -\-yz + xz)'^U 2 — y^{x + z)[xy -\-yz 4xz) = 0, 

(xy -hyz + xz)^u^ — z^(x + y)(yz -h xz -h 4xy) = 0, 

(x 4- y 4- z)(u\ — x"^) — xyz = 0, 

(x + y + z)(ul - - xyz = 0, 

{x + y + z)(ug — z^} — xyz = 0, 

X > 0, y > 0, z > 0, 

. Ml > 0, U2 > 0, U3 >0, Ui > 0, Us > 0, Us > 0, 

prove that ui + U2 + U3 < U4 + + uq. 

* The work is supported in part by NKBRSF-(G 1998030602). Lu Yang is concurrently 
at Guangzhou University, Guangzhou 510405, China. 



J. Richter-Gebert and D. Wang (Eds.): ADG 2000, LNAI 2061, pp. 2001. 

(c) Springer- Verlag Berlin Heidelberg 2001 



42 



Lu Yang and Ju Zhang 



Eliminating ... from (1) by solving the 6 equations, we convert the 
proposition to the following inequality which appeared as a conjecture in refer- 
ence [Tn| . 



Example 1. Show that 

^Jx^{y + z) {xy + xz + 4: yz) \/y^{x + z){xy + yz + Axz) 



xy + yz + xz 

^/z^{x + y){yz + xz + A xy) 
xy + yz + xz 



xy + yz + xz 



< 



xyz 

X + y + z 



xyz 

X + y + z 



xyz 

X + y + z 



( 2 ) 



where a: > 0, j/ > 0, 2 ; > 0. 

This includes 3 variables but 6 radicals, while (1) includes 9 variables. 



A dimension-decreasing algorithm introduced by the authors can efficiently 
treat parametric radicals and maximize reduction of the dimensions. Based on 
this algorithm, a generic program called “BOTTEMA” was implemented on a 
PC computer. More than 1000 algebraic and geometric inequalities including 
hundreds of open problems have been verified in this way. The total CPU time 
spent for proving 100 basic inequalitiet0 from Bottema et al.’s monograph 
Geometric Inequalities^’’ on a Pentium III/550 was 10-odd seconds only. It can 
be seen later that the inequality class, to which our algorithm is applicable, is 
very inclusive. 

The paper is organized as follows. Section 2: illustrate fundamental notions 
with examples; Section 3: sketch an inequality-proving algorithm which can treat 
radicals efficiently; Section 4: introduce a transformation of variable which helps 
reduce the degrees of some polynomials concerned in proving a class of inequali- 
ties on triangles, that class contains most of the inequalities in book PJ ; Section 
5: commands and syntax for running the program BOTTEMA; Section 6: show 
the performance of the program with a series of examples; Section 7: conclusion. 



2 Fundamental Definitions 

Before we sketch the so-called dimension-decreasing algorithm, some definitions 
should be introduced and illustrated. 

Definition 1. Assume that l{x, y,z, . . .) and r{x, y,z,. . .) are continuous alge- 
braic functions of x, y, z, . . .. We call 

l{x,y,z,...) <r{x,y,z,...) or l{x,y, z, . . .) < r{x,y, z, . . .) 



^ which include some classical results such as Euler’s Inequality, Finsler-Hadwiger’s 
Inequality, and Gerretsen’s Inequality. 
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an algebraic inequality m.x,y,z . . and l{x, y,z, . . .) = r{x, y,z , . . .) an algebraic 
equality in x, y, z, . . .. 

Definition 2 . Assume that ^ is an algebraic inequality (or equality) in x, y, z, 
. . . . L(T) is called a left polynomial of provided that 

— L(T) is a polynomial in T, its coefficients are polynomials in x,y, z, . . . with 
rational coefficients; 

— the left-hand side of ^ is a zero of L{T). 

The following item is unnecessary for this definition, but it helps to reduce the 
computational complexity in the process later. 

— Amongst all the polynomials satisfying the two items above, L{T) is what 
has the lowest degree in T. 

According to this definition, L{T) = T if the left-hand side is 0, a zero polyno- 
mial. The right polynomial of namely, R(T), can be defined analogously. 

Definition 3. Assume that ^ is an algebraic inequality (or equality) in x,y,. . . 
etc., L{T) and R{T) are the left and right polynomials of respectively. By 
P{x,y , . . .) denote the resultant of L{T) and R{T) with respect to T, and call 
it the border polynomial of and the surface defined by P(x, y, . . .) = 0 the 
border surface of respectively. 

The notions of left and right polynomials are needed in practice for computing 
the border surface more efficiently. In Example 1, we set 

fi = (xy + yz + xz)'^Ui — x^{y + z){xy + xz + 4 yz), 

/2 = {xy + yz + xz)'^U 2 — y^{x + z){xy + yz + 4xz), 

fs = {xy + yz + xz)'^ul - z^{x + y){yz + xz + 4xy), 

f 4 = {x + y + z){ul - x^) - xyz, 

f 5 = {x + y + z){ul - y^) - xyz, 

fe = {x + y + z){ul - z'^) - xyz, 

then the left and right polynomials of ( 2 ) can be found by successive resultant 
computation: 

resultant(resultant(resultant(ui + U 2 + — T, /i, Ui), / 2 , U 2 ), /s, U 3 ), 

resultant(resultant(resultant(u 4 + U 5 + uq — T, f^, U 4 ), f^, U 5 ), fe, ug). 

Removing the factors which do not involve T, we have 

L{T) = {xy + X z + y z)^ — 4(x"^ y'^ + 2 x'^ y z + x^ z"^ + 4x^ y"^ z + 4x^ y z'^ 

+ x"^ y“^ + 4x'^ y^ z + Ax"^ y z^ + x"^ z“^ + 2 xy'^ z + 4xy^ z"^ + Axy"^ z^ 

+ 2 X y z* + y^ z"^ + y'^ z'^){x y + X z + y z)^ + ■ ■ ■ , 
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R{T) = {x + y + T® — 4(x^ + x‘^y + x'^z + xy'^ + 5xyz + xz'^ + y^ + y‘^z 
+ y z^ + z^) {x + y + z)^ + 2{16 xy z* + 14:xy^ z^ + 14:xy^ z^ + 16 xy'^ z 

+ lAx"^ y z^ + 14x'^ y^ z + 14x^ y z^ + 14x^ y^ z + 16x‘’' y z + 3x^ + 5 x* y^ 

+ 5x^ 2^ + + 5i/^ + 21 y^ z"^ + 3y^ + 3 z^ 

+ 6x^y + 6x^ z + 4x^y^ + 4x^ z^ + 6xy^ + 6xz^ + 6y^ z + 4y^ z^ + 6yz^) 
{x + y + z)^ 

— 4(x + y + z) (x® — X* y^ — x^ z^ + 2 x^ y^ z + 2 x^ y z^ — x^ y* + 2 x^ y^ z 

+ 7x^ y^ z^ + 2x^ yz^ - x^ z'^ + 2xy^ z^ + 2xy^ z^ + y^ - y'^ z^ - y^ z'^ + z®) 
(x® + 3a;^?/ + 3x^z + 3x2/^ + 7a;j/z + 3a;z^ + y® + 3y^z + 3yz^ + z®)r^ 

+ (-6 a; z® - 6 a; y® z^ - 6 y z® - 6 y® z - 6 a;® y z^ - 6 a;® z + X® 

— X y — X z — X y — x z — y z — y z —9x y z -\- y -\- z -\- 2x y 

+ 2x® z - 4x®y® - 4x® z® + 2xy® + 2xz® + 2y® z - 4y® z® + 2yz®)^. 



The successive resultant computation for L{T) and R{T) spent CPU time 0.51s 
and 0.08s, respectively, on a Pentium 111/550 with Maple V.5.1. And then. It 
took us 111.21s to obtain the border polynomial of degree 100 with 2691 terms. 

We may of course reform (2) to the equivalent one by transposition of terms, 

e.g. 



\/x^{y + z){xy + xz + 4yz) ^ \/y^{x + z) (xy + yz + 4 xz) 
xy + yz + xz xy + yz + xz 

\/z^ (x + y) (yz + xz + 4 xy) 



< 



xy + yz + xz 
_l_ xyz 
X + y + z 



- X X 



xyz 

X + y + z 



xyz 

X + y + z 



( 3 ) 



However, the left polynomial of (3) cannot be found on the same computer (with 
memory 256 Mb) by a Maple procedure as we did for (2), 
f : =ul+u2+u3-u4-u5-T ; 

for i to 5 do f : =resultant (f ,f . i ,u. i) od; 
after running 1066.52s, the screen shows “Error, object too large” at last. 

One might try to compute the border polynomial directly without employing 
left and right polynomials, that is, using the procedure 
f ; =ul+u2+u3-u4-u5-u6 ; 

for i to 6 do f : =resultant (f ,f . i ,u. i) od; 
but the result is not better. After running 1453.67s, the screen shows “Error, 
object too large” once again. 

Example 2. Given an algebraic inequality in x,y, z. 



nia + mi, + me <2s 



( 4 ) 
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where 

TOa = ^ a/2 (x + yy + 2(x + z)^ ~(y + z)^, 

mb = i \/2 (y + z)^ + 2(x + y)^ - (x + z)^, 

me = ^ \/2 (x + z)^ + 2 (y + z)^ - (x + y)^, 

s = X + y + z 

with a; > 0, y > 0, z > 0, compute the left, right and border polynomials. 
Let 



fi = iml + {y + z^ -2{x + y)'^ -2{x + z)'^, 
f 2 =4ml + {x + z)'^ -2{y + z)'^ -2{x + yy, 
fs = 4ml + {x + y)'^ - 2 {x + zf - 2 (y + zf 

and do successive resultant computation 

resultant (resultant (resultant (m^j + mt, + rric — T, /i, mo), / 2 , mt,), /a, rric), 

we obtain a left polynomial of (4): 

T® — 6 (x'^ + y^ + z'^ + xy + yz + zx)T^ + 9{x^ + 2 x y z'^ + y'^ + 2 x z^ 

+ 2x^y + z'^ + 'iy'^z'^ + 2y‘^zx + 2y^z + 2yz^ + ix'^z‘^ + 2x^z + 2x'^yz 

+ 2xy^ + 5x‘^ y‘^)T^ - (72 x^yz + 78 x^yz"^ + 4x^ + 4y^ +4 z^ + 12 xy^ 

-8x^ 1/2 _ 3^2 _3^2 yi _ 3^4 z* - z^ -26x^ y^ - 26cc® z® 

- 26 y® z^ + 12x z^ + I2y^ z + I2y z^ + 12x^ z + 12x^ y + SAx"^ y^ 2 ^ 

+ 72xyz'^ + 72xy'^z + 78xy^z‘^ + 78xy'^z^ + 78x'^yz^ + 78 x® 

+ 78 x2 y® z)T'^ + 81 x2 y2 z2 (x + y + z)^. (5) 

It is trivial to find a right polynomial for this inequality because the right-hand 
side contains no radicals. We simply take 

T-2(x + y + z). (6) 

Computing the resultant of (5) and (6) with respect to T, we have 

(144 x^y+ 144 x^ z + 780 x^ y^ + 1056 x^ y z -k 780 x^ ^288 x® y® 

-k 3048 X® y2 z -k 3048 x® y z^ + 1288 x® z® -k 780 x^ y® + 3048 x^ y® z 
-k 5073 x2 y2 z2 + 3048 x^ y z® -k 780 x^ z® -k 144 x y® -k 1056 x y® z 
-k 3048 X y® z2 + 3048 x y^ z® -k 1056 x y z® -k 144 x z® -k 144 y® z -k 780 y® z^ 

-k 1288 y® z® -k780y2 z"^ + lUy z^)(x + y + zf . 
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Removing the non- vanishing factor {x + y + z)"^ , we obtain the border surface 

144 x^y + 144 x^ z + 780 -h 1056 x^ y z + 780 x^ z^ + 1288 x^ y^ 

+ 3048 x^y"^z + 3048 x^ y z"^ + 1288 x^ -h 780 x^ y^ + 3048 x^ y^ z 

+ 5073 x^ -h 3048 x^ y z^ + 780 x^ z'^ -h 144 x y® -h 1056 x y^ z 

-h 3048 X y^ z^ -h 3048 x y^ z^ -h 1056 x y z'^ -h 144 x z^ + 144 y® z -h 780 y^ z^ 

-h 1288 y3 z^-h 780 y^z^-h 144 yz® = 0. (7) 

3 A Sketch to Dimension-Decreasing Algorithm 

In the present paper, we deal with a class of propositions which take the following 
form (though the algorithm is applicable to a more extensive class): 



A A • • • A ^ ^0, 



where <1>q, <l>i, . . . , <l>s are algebraic inequalities in x, y, z, . . . etc., the hypothesis 

A A - • • A defines either an open selHor an open set with the whole/partial 
boundary. 

Example 1 may be written as (x > 0) A (y > 0) A (z > 0) => (2), where the 
hypothesis (x > 0) A (y > 0) A (z > 0) defines an open set in the parametric 
space R^, so it belongs to the class we described, so does Example 2. This class 
covers most of inequalities in Bottema et al.’s book P and Mitrinovic et al.’s 0 
“Recent Advances in Geometric Inequalities^’’ . In fact. Example 2 is a geometric 
inequality encoded in x, y, z, see p. 

We take the following procedures when the conclusion I>q is of type <. (As 
for I>Q of type <, what we need do in additional is to verify if the equation 
Iq{x, y, . . .) — ro(x, y, . . .) =0 has no real solutions under the hypothesis, where 
lo{x,y , . . .) and ro(x,y , . . .) denote the left- and right-hand sides of <Po, respec- 
tively.) 

— Find the border surfaces of the inequalities <Po, , ^s- 

— These border surfaces decompose the parametric space into a finite number of 
cells. Among them we just take all the connected open sets, Di^D 2 , . . . , Dk, 
and discard the lower dimensional cells. Choose at least one test point in 
every connected open set, say, {x^, y ^, . . .) G D^, z/ = 0, 1, . . . , fc. This step 
can be done by an incomplete cylindrical algebraic decomposition which 
is much easier than the complete one since all the lower dimensional cells 
were discarded. Furthermore, we can make every test point a rational point 
because it is chosen in an open set. 

— We need only check the proposition for such a finite number of test points, 
(xi, yi, . . .), . . ., (xfc, y/c, . . .). The statement is true if and only if it holds over 
these test values. 

^ may be disconnected. 
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The proof of the correctness of the method is sketched as follows. 

By . . .), r^{x,y, . . .) and Pf^{x,y , . . .) = 0 denote the left-, right-hand 

sides and border surface of respectively, and 

Sf,{x,y,...) =U^{x,y,...) -r^(a;,y, ...), 



for /i = 0, . . . , s. 

The set of real zeros of all the y, . . .) is a closed set, so its complementary 
set, say A, is an open set. On other hand, the set 

D=^DiU---[JDk 

is exactly the complementary set of real zeros of all the Pfi{x, y,- ■ ■)■ 

We have D C A since any zero of 6ft{x, y, ■ ■ ■) must be a zero of P^{x, y , . . .). 
By Z\i, . . . , Z\i denote all the connected components of A, so each one is a con- 
nected open set. Every A^ must contain a point of D for an open set cannot be 
filled with the real zeros of all the P^{x, y , . . Assume that A\ contains a point 
of Di, some connected component of D. Then, Di C A\ because it is impossible 
that two different components of A both intersect Di. By step 2, Di contains 
a test point {xi,yi , . . .). So, every A\ contains at least one test point obtained 
from step 2. 

Thus, 6^{x,y,...) keeps the same sign over A\ as that of 6^{xi^,yi^, . . .) 
where {xi^ ^yix ■ ■ •) is a test point in A\, for A = 1, . . . , t; /i = 0, . . . , s. Otherwise, 
if there is some point (a;', y', . . .) G Z\a that 5^{x' , y ' , . . .) has the opposite sign to 
,yi^, . . .), connecting two points (x', y ', . . .) and (xi^ ,yi^,...) with a path P 
such that r C A\, then there is a point (x, y, . . .) G P such that Sfi(x, y, . . .) = 0, 
a contradiction! 

By AU P denote the set defined by the hypothesis, where A is an open set 
defined by 

(<5i(x, y, . . .) < 0) A • • • A (5s(x, y, . . .) < 0), 

that consists of a number of connected components of A and some real zeros 
of <5o(x, y, . . .), namely A = Q U S' where Q = Z\i U • • • U and S is a set of 
some real zeros of So{x,y , . . .). And B is the whole or partial boundary of A, 
that consists of some real zeros of <5^(x, y, . . .) for y = 1, . . . , s. 

Now, let us verify whether i5o < 0 holds for all the test points in A, one by one. 
If there is a test point whereat Sq > 0, then the proposition is false. Otherwise, 
i5o < 0 holds over Q because every connected component of Q contains a test 
point and Sq keeps the same sign over each component A\, hence (5 q < 0 holds 
over A by continuity, so it also holds over A U i?, i.e., the proposition is true. 

The above procedures sometimes may be simplified. When the conclusion 
belongs to an inequality class called “class CGR”, what we need do in step 3 is 
to compare the greatest roots of left and right polynomials of over the test 
values. 
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Definition 4. An algebraic inequality is said to belong to class CGR if its 
left-hand side is the greatest (real) root of the left polynomial L{T), and the 
right-hand side is that of the right polynomial R{T). 

It is obvious in Example 1 that the left- and right-hand sides of the inequality 
(2) are the greatest roots of L{T) and R{T), respectively, because all the radicals 
have got positive signs. Thus, the inequality belongs to class CGR. What we 
need do is to verify whether the greatest root of L{T) is less than or equal to 
that of R{T), that is much easier than determine which is greater between two 
complicated radicals, in the sense of accurate computation. 

If an inequality involves only mono-layer radicals, then it always can be 
transformed into an equivalent one which belongs to class CGR by transposition 
of terms. Actually, most of inequalities in and |0|, including the examples in 
the present paper, belong to class CGR. For some more material, see HS|. 

4 Inequalities on Triangles 

An absolute majority of the hundreds inequalities discussed in ^re on trian- 
gles, so are the thousands appeared in various publications since then. 

For geometric inequalities on a single triangle, usually the geometric invari- 
ants are used as global variables instead of Cartesian coordinates. By a, b, c 
denote the side-lengths, s the half perimeter, i.e. ^{a + b + c), and x, ?/, z denote 
s — a,s — b,s — c, respectively, as people used to do. In additional, by A, B, C the 
interior angles, S the area, R the circumradius, r the inradius, ra,rb, the radii 
of escribed circles, ha,hf,,hc the altitudes, ma^rnt^mc the lengths of medians, 
Wa, Wb, Wc the lengths of interior angular bisectors, and so on. 

People used to choose x, y, z as independent variables and others dependent. 
Sometimes, another choice is better for decreasing the degrees of polynomials 
occurred in the process. 

An algebraic inequality y, z) can be regarded as a geometric inequality 
on a triangle if 

— X > 0, ?/ > 0, z > 0; 

— the left- and right-hand sides of <P, namely l{x,y,z) and r{x,y,z), both are 
homogeneous; 

— l{x,y,z) and r{x,y,z) have the same degree. 

The item 1 means that the sum of two edges of a triangle is greater than the third 
edge. The items 2 and 3 means that a similar transformation does not change 
the truth of the proposition. For example, (4) is such an inequality for its left- 
and right-hand sides, TOq + rnf, + rric and 2 s, both are homogeneous functions 
(of X, y, z) with degree 1. 

In addition, assume that the left- and right-hand sides of <?(x, y, z), namely, 
l{x,y,z) and r(x,y,z), both are symmetric functions of x, y, z. It does not 
change the truth of the proposition to replace x, y, z in l{x,y,z) and r{x,y,z) 
with x',y',z' where x' = px, y' = py, z' = pz and p > 0. 
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Clearly, the left and right polynomials of y' , z'), namely, A(T, x', t/', z') 
and R{T, x', y' , z'), both are symmetric with respect to x', y' , 2 ', so they can be 
re-coded in the elementary symmetric functions of x' ,y' , z' , say. 



Hi{T, CTi,(T 2 ,cr 3 ) = L{T, x',y',z'), Hr{T, ai,a 2 ,aa) = R(T, x',y',z'), 
where ai = x' + y' + z', 172 = x'y' + y'z' + z'x' , = x'y' z' . 

Setting p = ^ we have x'y' z' = x' + y' + z' , i.e., 0-3 = CTi. Further, 

letting 

S = Cri(=CT3), P=(T2-9, 

we can transform L{T, x', y' , z') and R{T, x', y', z') into polynomials in T, p, s, 
say, F{T,p, s) and G{T,p, s). Especially if F and G both have only even-degree 
terms in s, then they can be transformed into polynomials in T, p and q where 
q = — Ap — 27. Usually the degrees and the numbers of terms of the latter 

are much less than those of L{T,x,y, z) and R{T,x,y, z). We thus construct 
the border surface which is encoded in p, s or p, q, and do the decomposition 
described in last section on (p, s)-plane or (p, q)-plane instead of R^. This may 
reduce the computational complexity considerably for a large class of geometric 
inequalities. The following example is also taken from 

Example 3. By Wa, Wb, Wc and s denote the interior angular bisectors and half 
the perimeter of a triangle, respectively. Prove 

WbWc + WcWa + WaWb < . 



It is well-known that 

2 ^Jx{x + y){x + z){x + y + z) 

2x + y + z 

2 ^/y {x + y)(y + z){x + y + z) 

2y + x + z 

2 \/ z {x + z){y + z){x + y 

2 z + X + y ’ 

and s = X -I- p -I- z. By successive resultant computation as above, we get a left 
polynomial which is of degree 20 and has 557 terms, while the right polynomial 
T — (x-|-p-|-z)^ is very simple, and the border polynomial P(x, p, z) is of degree 
15 and has 136 terms. By this routine, the whole proving process take us 2697 
sec on a Pentium III/550 computer. 

However, if we encode the left and right polynomials in p, q, we get 
(9p-k2p-h64)^T^-32 

(4p -h g -k 27) (p -k 8) (4p2 + pg + 69p-kl0g-k 288) (9p -k 2 g -k 64)^ 

- 512 (4p -k g -k 27f (p -k Sf (9p -k 2 g -k 64)^ T + 256(4p -k g -k 2lf 
(p -k 8)2 (-1024 - 64p -k 39p2 - 128 g - 12p g - 4 g2 -k 4p3 -k p2 g) 



Wa = 
Wb = 
Wc = 
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and T — 4p — q — 27, respectively, hence the border polynomial 

Q{p, q) = 5600256/ g + 50331648p + 33554432g + 5532160/ 

+ 27246592/ + 3604480 / + 22872064pg + 499291/ + 16900/ 

+ 2480 / + 16 / + 143360 / + 1628160 p / + 22945 p^ q 
+ 591704 p3 q + 11944 / + 2968 p^ / + 242568 p^ / + 41312p/ 

+ 352p/ 

which is of degree 5 and has 20 terms only. The whole proving process in this 
way spend about 0.10s on the same machine. 

5 Commands and Syntax 

As a prover, the whole program is written in Maple V.5.1 including the cell 
decomposition, without external packages employed. 

On verifying an inequality with BOTTEMA, we need only type in a proving 
command, then the machine will do everything else. If the statement is true, 
the computer screen will show “The inequality holds'"', otherwise, it will show 
“The inequality does not hold" with a counter-example. There are three kinds 
of proving commands: prove, xprove and yprove. 

prove - prove a geometric inequality on a triangle, or an equivalent algebraic 
inequality. 

Calling Sequence: 

prove(ineq); 
prove(ineq, ineqs); 

Parameters: 

ineq - an inequality to be proven, which is encoded in the geometric invariants 
listed later. 

ineqs - a list of inequalities as the hypothesis, which is encoded as well in 
the geometric invariants listed later. 

Description: 

— The command ‘prove’ is valid to a geometric inequality on a triangle that 
must be of type ‘<’ or ‘>’, with a set of inequalities ‘ineqs’ as the hypothesis 
which defines either an open set or an open set with the whole/partial bound- 
ary; and ‘ineq’ and ‘ineqs’, must be represented by the rational functions and 
radicals in the geometric invariants listed below. 

— The command ‘prove’ also valid to a statement whose hypothesis and thesis, 
‘ineqs’ and ‘ineq’, all are homogeneous algebraic inequalities represented by 
the rational functions and radicals in x,y,z provided cc > 0, y > 0, z > 0, 
and of the required types as above. This is equivalent to a geometric one, as 
shown in last section. 
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— The following list of geometric invariants is extendable. 

Geometric Invariants on a Triangle: (extendable) 



a, b, c, 


lengths of sides of a triangle ABC 


s, 


s:=(a+b+c)/2, half the perimeter 


x,y,z, 


x:=s-a, y:=s-b, z:=s-c 


s, 


Area of the triangle 


R, 


circumradius 




inradius 


ra, rb, rc, 


radii of escribed circles 


ha, hb, he, 


altitudes 


ma, mb, me. 


medians 


wa, wb, we. 


interior-angle-bisectors 


P, 


p;=4=Kr*(R-2*r) 


q, 


q:=s~2-16*R*r+5*r"2 


HA,HB,HC, 


distances from orthocenter to vertices 


lA, IB, IC, 


distances from incenter to vertices 


sin(A), sin(B), sin(C), 


sines of the interior auigles 


eos(A), eos(B), eos(C), 


cosines of the interior angles 


tan(A), tcm(B), tcOi(C), 


tangents of the interior angles 


eot(A), eot(B), eot(C), 


cotangents of the interior angles 


see(A), see(B), sec(C), 


secants of the interior angles 


ese(A), esc(B), esc(C), 


cosecants of the interior auigles 


sin(A/2), sin(B/2), sin 


(C/2), 


eos(A/2), eos(B/2), eos 


(C/2), 


tan(A/2), tan(B/2), tcm(C/2), 


eot(A/2), cot(B/2), eot(C/2), 


see(A/2), see(B/2), see 


(C/2), 


ese(A/2), ese(B/2), esc 


(C/2), 



Examples: 

> read bottema; 

> prove (a~2+b~2+c~2>=4*sqrt (3) *S+(b-c) ~2+(c-a) ~2+(a-b) ~2) ; 

The theorem holds 

> prove(cos(A)>=cos(B) , [a<=b] ) ; 

The theorem holds 

xprove - prove an algebraic inequality with positive variables. 

Calling Sequence: 

xprove(ineq); 
xprove(ineq, ineqs); 
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Parameters: 

ineq - an algebraic inequality to be proven, with positive variables, 
ineqs - a list of algebraic inequalities as the hypothesis, with positive vari- 
ables. 

Description: 

— The command ‘xprove’ is valid to an algebraic inequality ‘ineq’ which must 
be of type ‘<’ or with a set of inequalities ‘ineqs’ as the hypothesis which 
defines either an open set or an open set with the whole/partial boundary. 

— All the hypothesis and thesis must be represented by the rational functions 
and radicals. 

— All the variables appear in ‘ineq’ are supposed always positive that conditions 
need not be explicitly included. 

Examples: 

> read bottema; 

> xprove (sqrt (u~2+v~2)+sqrt ( (1-u) ~2+(l-v) ~2) >=sqrt (2) , 

[u<=l,v<=l] ) ; 

The theorem holds 

> f :=(x+l) ~(l/3)+sqrt(y-l)+x*y+l/x+l/y'2: 

> xprove (£>=42496/10000, [y>l] ) ; 

The theorem holds 

> xprove (£>=42497/10000, [y>l] ) ; 

with a counter example 

\_29 _ 294117648 ' 

294117647 

The theorem does not hold 
yprove - prove an algebraic inequality in general. 

Calling Sequence: 

yprove(ineq); 
yprove(ineq, ineqs); 

Parameters: 

ineq - an algebraic inequality to be proven. 

ineqs - a list of algebraic inequalities as the hypothesis. 

Description: 

— The command ‘yprove’ is valid to an algebraic inequality ‘ineq’ which must 
be of type ‘<’ or ‘>’, with a set of inequalities ‘ineqs’ as the hypothesis which 
defines either an open set or an open set with the whole/partial boundary. 
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— All the hypothesis and thesis must be represented by the rational functions 
and radicals. 

Examples: 

> read bottema; 

> f : =x"6*y”6+6*x~6*y”5-6*x~5*y"6+15*x~6*y~4-36*x~5*y~5+15*x"4*y"6 

+20*x"6*y~3-90*x"5*y"4+90*x~4*y~5-20*x~3*y~6+15*x"6*y"2 

-120*x"5*y"3+225*x~4*y~4-120*x~3*y"5+15*x~2*y"6+6*x~6*y 

-90*x"5*y~2+300*x~4*y'3-300*x"3*y~4+90*x'2*y"5-6*x*y~6+x~6 

-36*x"5*y+225*x"4*y"2-400*x~3*y~3+225*x"2*y~4-36*x*y~5+y~6 

-6*x~5+90*x~4*y-300*x~3*y"2+300*x~2*y~3-90*x*y~4+6*y~5+15*x~4 

-120*x"3*y+225*x"2*y"2-120*x*y~3+15*y~4-20*x"3+90*x~2*y 

-90*x*y~2+20*y~3+16*x~2-36*x*y+16*y"2-6*x+6*y+l : 

> yprove (f >=0) ; 



The theorem holds 



6 More Examples 

The well-known Janous’ inequality 0 which was proposed as an open problem 
in 1986 and solved in 1988. 

Example 4. By rUa, ms, rric and 2 s denote the three medians and perimeter of 
a triangle, show that 

1115 

1 1 ^ — . 

rua rrib rric s 

The left-hand side of the some difficult inequality implicitly contains three 
radicals. BOTTEMA automatically interprets the geometric proposition to al- 
gebraic one before proves it. The total CPU time spent for this example on a 
Pentium III/550 is 29.22s. 

The next example was proposed as an open problem, E. 3146*, in the Amer. 
Math. Monthly 93:(1986), 299. 

Example 5. By a, b, c and s denote the side-lengths and half perimeter of a 
triangle, respectively. Prove or disprove 

2 s(\/s — a + V s — b + \/s — c) < 3 (\/6c(s — a) + \J ca(s — b) + \J ab{s — c)). 

The proof took us 48.60s on the same machine. 

The following open problem appeared as Problem 169 in Mathematical Com- 
munications (in Chinese). 

Example 6. By ra,rb,rc and Wa,Wb,Wc denote the radii of the escribed circles 
and the interior angle bisectors of a triangle, respectively. Prove or disprove 

, 1, 

^TaThTc < ■^{Wa + Wb + Wc). 
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In other words, the geometric average of ra,rb,rc is less than or equal to the 
arithmetic average of Wa,Wh,Wc- 

The right-hand side of the inequality implicitly contains 3 radicals. BOT- 
TEMA proved this conjecture on a Pentium III/550 with CPU time 96.60s. One 
more conjecture proposed by J. Liu m was proven on the same machine with 
CPU time 375.14s. That is: 

Example 7. By a, b, c, rria, mf,, me and Wa, Wb, Wc denote the side lengths, 
medians and interior-angle-bisectors of a triangle, respectively. Prove or disprove 



The following conjecture was first proposed by J. Garfunkel at Crux Math. 
in 1985, then re-proposed twice again in and p]. 

Example 8. By A, B, C denote the three angles of a triangle. Prove or disprove 



It was proven on a Pentium III/550 with CPU time 163.54s. 

A. Oppenheim studied the following inequality [3| in order to answer a prob- 
lem proposed by P. Erdos. 

Example 9. Let a, b, c and TOq, mb, m^ denote the side lengths and medians of 
a triangle, respectively. If c = min{a, b, c}, then 



The hypothesis includes one more condition, c = min{a, b, c}, so we type in 

prove(2*ma+2*mb+2*mc<=2*a+2*b+(3*sqrt(3)-4)*c, [c<=a,c<=b]) ; 

This took us 547.65s on the same machine. If we type in 

prove (2=t=ma+2*mb+2*mc<=2*a+2*b+ (3*sqrt (3) -4) *c) ; 

without the additional condition, the screen will show “T/ie inequality does not 
hold'' with a counter-example, [a = 203, b = 706, c = 505]. 

A problem of positive semi-definite decision is originated from one of the 
conjectures proposed by B. Q. Liu |7|: 

Example 10. Assume that x > 0, y > 0, z > 0. Prove 



2187(?/V(?/ + zf{2x + y + zf + + z)^(x + 2y + z)® -b 

x‘^y‘^(x + y)'^(x + y + 2 z)®) — 256(a; + y + z)^(x + y)‘^{x + z)'^{y + z)^ > 0. 







COS 



B-C C-A A-B 
__+cos^-+cos^-< 




2 rUa + 2 mb + 2 me < 2a + 26-f (3^3-4) c. 
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The polynomial after being expanded is of 201 terms with the largest coefficient 
(absolute value) 181394432. Usually it is non-trivial to decide a polynomial to be 
positive semi-definite or not, but this one took us CPU time 2.23s only, because 
of the homogeneity and symmetry which can help decrease the dimension and 
degree concerned. 

There are two well-known geometric inequalities. One is the so-called “Euler’s 
Inequality”, R>2r, another is rria > Wa- They are often cited in illustration of 
various algorithms I2H1I12I for inequality proving. The following example makes 
a comparison between the two differences, R — 2r and ma — Wa- 



Example 11. By R, r denote the circumradius and inradius of a triangle, and 
TOa, Wa the median and the interior angle bisector on a certain side; prove 



rua - Wa < R - 2r. 



It took us 15. 73s. 

The geometric inequalities which can be verified by the program, of course, 
are not limited to those on triangles. To prove the so-called “Ptolemy Inequality” , 
we will use Cartesian coordinates instead of geometric invariants. 

Example 12. Given four points A,B,C,D on a plane, by AB,AC,AD,BC, 
BD, CD denote the distances between the points, respectively. Prove 

AB ■ CD + BC ■ AD > AC ■ BD. (8) 

Put A = (— i, 0), B = (x, y), C = (i, 0), H = (m, v), and convert (8) to 

- 2;)^ + 2/^ - uY + v'^ + sj {x - + y'^ sj (-i - uY + v'^ 

> \/{x - uY + {y - vY- (9) 

We need just type in “yprove ("/.)” where 7. stands for inequality (9). The screen 
shows “The inequality holds” after running 2.70s. 

According to our record, the CPU time spent (on a Pentium III/550) and 
the numbers of the test points for above examples are listed as follows. 



Example 


1 


702.15s 


23 test points 


Example 


2 


0.18s 


1 test point 


Example 


3 


0.10s 


1 test point 


Example 


4 


29.22s 


12 test points 


Example 


5 


48.60s 


135 test points 


Example 


6 


96.60s 


4 test points 


Example 


7 


375.14s 


3 test points 


Example 


8 


163.54s 


121 test points 


Example 


9 


547.65s 


287 test points 


Example 


10 


2.23s 


2 test points 


Example 


11 


15.73s 


22 test points 


Example 


12 


2.70s 


48 test points 
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The time listed above includes that spent for everything, finding the left, right 
and border polynomial, cell decomposition, one-by-one sample point test, etc. 

7 Conclusion 

— This program is applicable to any inequality-type theorem whose hypothesis 
and thesis all are inequalities in rational functions or radicals, but the thesis 
is of type “<” or “>”, and the hypothesis defines either an open set or an 
open set with the whole/partial boundary. 

— It is beyond the capacity of this prover to deal with the algebraic functions 
other than the rational ones and radicals. 

— It runs in a completely automatic mode, without human intervention. 

— It is especially efficient for geometric inequalities on triangles. The input, in 
this case, is encoded in geometric invariants. 

We will make the package available publicly as soon as possible. The inter- 
ested reader may contact us by e-mail luyang@guangztc.edu.cn or cdluyang@ 
mail.sc.cninfo.net for further information. 
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Abstract. We develop a probabilistic test for the vanishing of radical 
expressions, that is, expressions involving the four rational operations 
(+, — ,x,^) and square root extraction. This extends the well-known 
Schwartz’s probabilistic test for the vanishing of polynomials. The prob- 
abilistic test forms the basis of a new theorem prover for conjectures 
about ruler & compass constructions. Our implementation uses the Core 
Library which can perform exact comparison for radical expressions. 
Some experimental results are presented. 



1 Introduction 

Several approaches to proving theorems in Elementary Geometry using con- 
structive methods in Computer Algebra were proposed in the 1980s |7j. These 
were much more successful than earlier approaches based on purely logical or 
axiomatic approaches. Thus, Kutzler, Stifter d and Kapur m proposed meth- 
ods based on Grobner Bases. Carra and Gallo |1I8| devised a method using the 
dimension underlying the algebraic variety. Hong m introduced semi-numerical 
methods (“proof by example” techniques) based on gap theorems. An acclaimed 
approach in this area is due to Wu who applied the concept of charac- 

teristic sets to geometric theorem proving. Extensive experimentation with Wu’s 
method were reported by Chou m- 

All these algebraic approaches begin by translating the geometric statements 
into algebraic ones. A proposed geometry theorem (also called a conjecture) is 
translated algebraically into two parts: a system H of multivariate polynomials 
called the hypothesis, and a single polynomial T called the thesis. The conjecture 
is true if the vanishing of the hypothesis system implies the vanishing of the thesis 
polynomial. From the viewpoint of algebraic geometry, proving the conjecture 
amounts to showing that Var{H) C Var{T) where Var{S) is the algebraic 
variety defined by a set S of polynomials. This basic formulation must be refined 
in order to handle degeneracy conditions. 

* This work was performed while Daniela Tulone was at NYU. 

** This work is supported in part by NSF Grant #CCR 9402464. 



J. Richter-Gebert and D. Wang (Eds.): ADG 2000, LNAI 2061, pp. 58-|^^ 2001. 
(c) Springer- Verlag Berlin Heidelberg 2001 
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Wu’s “basic method” computes the pseudo-remainder of the polynomial the- 
sis with respect to the Wu-Ritt extended characteristic set of the hypotheses sys- 
tem. If the pseudo-remainder vanishes, then the conjecture is true provided the 
initials of the extended characteristic set do not vanish. Wu’s basic method has 
been successfully used to prove many classical and some new theorems in plane 
analytic geometry. The basic method fails if the variety Var{H) is reducible. 
To handle this, Wu’s “complete method” begins by decomposing Var{H) into 
irreducible components and applying the basic method to each component. A 
drawback in Wu’s method is that it works with an algebraically closed field. 
In particular, it is not a complete method for the real algebraic varieties. The 
present paper addresses a special case of real algebraic varieties. 

Grobner bases methods can be doubly exponential in the worst case |i 7fy4j 
The complexity for Wu’s method is somewhat better but remains an issue. To 
circumvent the high complexity, we investigate probabilistic methods m com- 
bined with “proof by example” techniques nn . In probabilistic theorem proving, 
we do not prove the validity of a conjecture in the classical sense. Instead, we 
either prove the invalidity of a conjecture (by showing a counter example) or 
else classify the conjecture as “true with the high probability 1 — e”. This latter 
classification must be properly understood since, classically, it is nonsense to say 
that a theorem is true with some probability. What is meant is that, relative to 
a set of experiments we conduct, the probability that the conjecture is false and 
we failed to discover this, is less than e. 

An interesting approach along these lines was given by Carra, Gallo and 
Gennaro |2|. They applied the Schwartz-Zippel i?iim probabilistic test for the 
vanishing of pseudo-remainders in Wu’s method. They considered conjectures in 
the classical setting of ruler & compass constructions. Such conjectures are ex- 
amined by testing the vanishing of Wu’s pseudo-remainder for randomly chosen 
examples. Each example is specified by a random choice of values for its param- 
eters. The random choices come from some suitable test set whose cardinality 
depends on the degree of the pseudo-remainder. The extended characteristic set 
as well as the pseudo-remainder are computed. If the pseudo-remainder is zero, 
then the example is successful; otherwise, as in Wu’s method, further investiga- 
tion is called for. While implementing their method, one of us (D.T.) discovered 
a serious efficiency issue. The degree of the pseudo-remainder is very high: if the 
conjecture involves C ruler & compass construction steps, then, the degree of 
the pseudo-remainder in P] (following |9f I Dj ) has the following bound: 



The cardinality of the test set is 2D, which is too large in practice. This bound 
applies to the test for “generic truth” . For “universal truth” , D can be improved 
to 2P ■ where P is the number of points in the construction. Unfortunately, 
practically no classical theorems are universal truths. 



Summary of New Results. (1) We develop an extension of the Schwartz- 
Zippel probabilistic zero test. While the Schwartz-Zippel test is applicable to 
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polynomials, we treat radical expressions by admitting the additional operations 
of division and square-roots. This adds considerable complexity to the proofs. 
Furthermore, for efficiency considerations, we use straight line programs to rep- 
resent radical expressions. The asymptotic time complexity of our probabilistic 
test is a low-order polynomial. Since radical expressions are common in many 
applications, we expect this new test to be generally useful. 

(2) We address the problem of computer proofs of geometric conjectures 
about ruler & compass constructions. The zero test of radical expressions is tailor 
fitted for this problem. Moreover, we combine randomness with the numerical 
approach of Hong to give additional efficiency. Thus, our approach appears to 
be intrinsically more efficient than previous general approaches (e.g., Wu’s or 
Grobner bases). 

(3) Our prover is implemented using the Core library |l bll ;-iil t)] . This is 

an unexpected application of our library, which was designed as a general C++- 
package to support the Exact Geometric Gomputation approach to robust 

algorithms. Preliminary experimental results are quite promising. We expect 
further improvements by fine-tuning our library for this specific application. 
Our prover is currently distributed with version 1.3 of the Core library (Aug. 
15, 2000) and available from http://cs.nyu.edu/exact/core/. 



Overview. The paper is organized as follows: Section 2 gives an overview of 
geometric conjectures about ruler & compass constructions. Section 3 gives our 
extension of Schwartz’s probabilistic test to radical expressions. Section 4 ad- 
dresses the application of our new probabilistic test to theorem proving. We 
conclude in Section 5. 



2 Theorem Proving for Ruler &: Compass Constructions 

We follow the algebraic approach which has been well-summarized by Ghou 
0. Ruler & compass operations may be seen as constructing lengths, points, 
lines and circles, collectively called geometric objects. A collection of such geo- 
metric objects will be called a geometric scene. We consider geometric scenes 
that are constructed incrementally using ruler & compass operations. The al- 
gebraic analogue of constructing a geometric object O amounts to introducing 
a pair of variables (a:, y) and corresponding polynomial equations hi{x, y,z ,. . .) 
(z = 1, 2, . . .) that must be satisfied if (x, y) lies on O. Here, hi may involve other 
variables z, . . ., from previously constructed objects. We shall classify the vari- 
ables introduced by our constructions into two sorts: independent and dependent 
variables. For short, the independent variables will be called parameters. It is 
instructive to give a concrete example (Figure E from |S|). 

Example 1 (Pascal’s Theorem). Let A, B, C, D, F and E be six points on a 
circle centered at O. Let P = AB p| DF, Q = BC p| FE and S = CD fj EA. 
Show that P, Q and S are collinear. 
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Fig. 1. Pascal’s Theorem. 

Let A = ( 0 , 0 ), O = (ui, 0 ), B = {xi,U2), C = {x2,u^), D = (x3,M4), 
F = (x4,M5), E = (X5,ue), P = (X7,xe), Q = (xg,xs), and S = (a;ii,xio). This 
gives the following equations for the hypotheses. 





Equation 


Geometry 


Remark 


hi 


xf — 2uiXi + M2 = 0 


[OA = OB] 


Introduces xi , M2 


h2 


X2 — 2uiX2 + M3 = 0 


[OA = OC] 


Introduces X2,M3 


h 3 


X3 — 2 miX 3 + M4 = 0 


[OA = OD] 


Introduces X3 , U4 


h 4 


X4 — 2miX4 + M5 = 0 


[OA = OF] 


Introduces X4 ,ms 




Xg — 2 miX 5 + Mg = 0 


[OA = OE] 


Introduces Xg,Mg 


he 


(m5 — M4)x7 + ( — X4 + X3)x6 + 
M4X4 — M5X3 = 0 


[P G DF] 


Introduces xe,X7 


hr 


M2X7 — xixg = 0 


[P G AB] 


Constrains xg, X7 


hs 


(ue - Ms)xg + (-X5 + X 4 )X 8 + 
M5X5 — ueX4 = 0 


[Q G EE] 


Introduces xs,xg 


hg 


(m 3 - M2)xg + (-X2 + Xi)x8 + 
U2X2 - M3X1 = 0 


[Q G BC] 


Constrains xs, xg 


hie 


: MgXii - X5X10 = 0 


[S G AE] 


Introduces xio,xn 


hii 


(m4 - M3)xii + (-X3 + X2)xio + 
M3X3 — M4X2 = 0 


[S G CD] 


Constrains xio,xn 



The conclusion that P,Q,S are collinear can be translated into the following 
polynomial: 



g = {xs- Xq)xxi + (-Xg + X 7 )xio + XgXg - X7X8 = 0 . 
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In general, we get a system of polynomial equations, h\ = h 2 = ■■■ = hi = 
0 where hi S K[ui, . . . , Um, a:i, • • • , a;„] (K is the field of real numbers), the 
Ml , ... , Um are parameters, and the x\, . . . ,Xn are dependent variables. The con- 
jecture has the form: 

(Vu,x)[/ii = /i 2 = • • • = ^<: = 0 ^ 5 = 0] (1) 

where u = (mi, . . . x = (xi, . . . ,x„) and g = 5 (u,x) G K[u,x]. 

Degeneracy and Generic Truth. A theorem of the form m is called a uni- 
versal truth. It turns out that the classical notion of theoremhood is more sub- 
tle, and this led Wu to formulate the notion of generic truth. We formalize it 
as follows: let Z\i, . . . , be predicates on the variables u, x. We call each Ai 
a non- degeneracy condition. The conjecture (CQ is generically true relative to 

{Ai,...,Ak} if 

(Vu,x)[Ai,A2,...,Afc,/ii = /i2 = ••• = = 0 ^ 5 = 0]. (2) 

Classical ruler-and-compass theorems are indeterminate in that they do not ex- 
plicitly specify the degenerate conditions. Hence part of “proving a classical the- 
orem” involves discovering a suitable set of non-degeneracy conditions. Hopefully 
the set is minimal is some sense (but not necessarily unique) . The simplest kind 
of non-degeneracy condition has the form 

A:dy^0 

where d is a polynomial. Call this the first kind of non-degeneracy condition. The 
degree of the A is equal to the total degree of d. If each Ai has degree di, then 
the degree of {A\, . . . , A}~} is Typical examples of the first kind of non- 

degeneracy may require two points to be distinct or two lines to be non-parallel. 
It is easy to see that both have degree 2. 

Example 1 (continued) . The non-degeneracy conditions require the intersection 
points P, S and Q be not at infinity. Equivalently, the following pairs of lines 
are not parallel: {AB^ DF}, {BC, FE}, {CD,EA}. So the degree of these non- 
degeneracy conditions is 6. 

Second Kind of Degeneracy. The second kind of non-degeneracy condition 
arises for theorems in the real field. For example, when we define a point by the 
intersection of two circles, we require that these two circles intersect. Or, when 
we define three collinear points A, B and C, we may require B to lie between 
the other two points. Such non-degeneracy conditions have the form 

A:d>0 



where d is a polynomial. We can modify this condition using a well-known trick: 

A' : 3z, d-z‘^ = 0 
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where z is a new variable. The existential quantifier on z can be pulled out as 
a prenex universal quantifier. Thus, we can formulate the conjecture as 

(Vu,x,z) {A',H ^ T). 

In practice, there may be other ways to handle this: in the Pascal example, such 
non-degeneracies demand that the parameters Uj (for j = 2, 3, 4, 5, 6 ) satisfy 
\uj\ A: |wi|- Our prover can handle non-degeneracy conditions of the second kind 
when put in this form. Indeed, in all the examples we looked at in |Sj, such a 
formulation is possible. 

Reduction to Radical Expressions. In a ruler & compass construction, each 
dependent variable is a radical function of the previously introduced variables. 
As exemplified by Pascal’s Theorem, all the dependent variables are introduced 
either (i) singly by a single equation (e.g., Xi is introduced by hi = 0) or (ii) 
in pairs by two equations (e.g., xq,X 7 are introduced by Hq = hr = 0). As all 
equations are at most quadratic, the Xi’s can be replaced by radical expres- 
sions involving the m^’s. Let G = G(u) be the radical expression after such a 
substitution into g(u,x). The universal truth conjecture (IQ now says 

(Vu)[G = 0], 

with an analogous statement for generic truth. Another issue arises: each radical 
is determined only up to a ± sign. Hence, if there are r radicals in G, we must 
replace G = 0 by the system of 2’’ radical expressions, Gi = G 2 = • • • = G 2 ^ = 0, 
in which each of the 2’' possible sign combinations are used. If a single function 
G*(u) is desired, we can use G* = The appearance of “2’’” in this 

expression may be disturbing from a complexity viewpoint. Several observations 
suggest that this is not serious in practice. First, r is typically small (r = 5 in 
Pascal’s theorem). Next, we can reduce the number of summands in G* from 
the worst case of 2'' terms. There are two ways this can happen: (A) Symmetries 
in the problem may arise so that many of the Gi’s can be omitted. (B) Certain 
sign combinations may be excluded by the nature of the construction and/or 
theorem so that G* may represent a sum of less than 2’’ radical expressions. 
In particular, using (A) and (B), we can always omit half of the summands in 
standard geometric theorems. Thus, 2’’“^ terms suffice in G*. 

Example 2 (Butterfly Theorem). We illustrate the reduction in the number of 
terms in G* using the Butterfly Theorem in jSl Example 2.4, p. 9]. The theorem 
concerns 4 co-circular points A, B, C and D. Let O be the center of this circle 
and E be the intersection of AC and BD. The points A, B,C, D, E form a 
“butterfly”. If the line perpendicular to OE and passing through E intersects 
the lines AD and BC at G and F (respectively), then the theorem says that 
segments EE and EG have the same length. There are 3 quadratic equations 
in formulating this theorem (so r = 3). In the construction described by Chou, 
the point E is placed at the origin (0,0) and O is placed at (ui,0). A is freely 
placed at (^ 2 ,^ 3 ). The point G is now completely determined, and has two 
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possible solutions. In one solution, C and A coincide, and the nature of the 
theorem excludes this case. Next, the points B is freely chosen on the circle (and 
this introduces one parameter). Again there are two possible solutions. But it 
is clear by symmetry that we can arbitrarily choose one of them without loss of 
generality. Therefore, G* only needs two terms (corresponding to choosing the 
2 solutions for D). ■ 

The fact that our prover can address theorems about real geometry is illus- 
trated by the following simple example. 

Example 3 (Triangle Bisectors). Let A, B, C be three non-linear points, and 
D be the intersection point of the angle bisectors of AA and AB in the triangle 
AABC. We want to prove that D must be on the bisector of AC in AABC. 




Fig. 2. Coincidence of three angle bisectors. 

Let A = (0,0), B = (mi,0), C = (u 2 ,U 3 ), D = {x 4 ^,x^). This gives the 
following equations for the hypotheses. 



Equation 


Geometry 


Remark 


hi : x'l — uf = 0, xi > 0 


[xi ^ ||Ai?||] 


Introduces xi 


h2 '■ X2 — U2 — U3 = 0, X2 > 0 


[X 2 ^ ||AC||] 


Introduces X2 


h3 : X3 — {ui — U2)'"‘ — u's = 0, X3 > 0 


[X 3 ^ ||i?C||] 


Introduces X3 


hi : {x\U2 — X2U\)xa + X1U3X5 = 0 


[D G hisector{AA)] 


Constrains xa,X3 


. ^ [{U2 - Ui)xi + U1X3KX4 - Ui) + 


[D G hisector{AB)\ 


Constrains xa,X3 



The conclusion that D is on the bisector of angle AC can be formulated as 
the following thesis: 



g = {xa- X2){uiX2 - U2X2 + U2X3) - (X 5 - X3){X3 - X2)u3 = 0 

The formulation explicitly introduces inequalities for xi,X 2 ,X 3 to pick the 
internal angle bisectors. When regarded as a complex theorem, no such inequal- 
ities are allowed. In this case, each “bisector” can refer to either the internal 
or external bisector of an angle, so there are a total of 8 = 2^ choices for these 
bisectors. The “thesis” is true for exactly four of these choices, which also means 
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that the theorem is false in complex geometry. Let G(u) be the radical expres- 
sion after eliminating the dependent variables from g. The 8 choices of bisectors 
correspond to different assignment of signs to the three radicals in G(u). Our 
prover can be used to test the validity of each choice. ■ 

3 Randomized Zero Testing for Radical Expressions 

3.1 Straight Line Programs 

We need to generalize expressions to straight line programs (SLP). A SLP tt is 
a sequence of steps where each step is an assignment to a new programming 
variable. The fth step of a SLP has one of the forms 



Zi-i^xtoyi, (o e {-K, x,d-}) (3) 

Zi f— \/a?i (4) 

where Zi is a newly introduced programming variable, Xi and yi are either real 
constants, input variables or programming variables introduced in some earlier 
steps. Alternatively, we call an input variable an independent variable (or, pa- 
rameter) and a programming variable a dependent variable. These Xi and yi are 
said to be used in the Ah step. The last introduced variable is called the main 
variable and it is never used. In general, a SLP can have branching steps. But 
this possibility is not considered in this paper. 

An expression is a SLP where, with the exception of the main variable, each 
programming variable is used exactly once. Underlying each SLP is a labeled and 
ordered dag (directed acyclic graph) defined in the obvious way: each node corre- 
sponds to a constant or variable in the SLP. We often use the terms “nodes” and 
“variables” interchangeably. For the steps in (0) (resp., O), we introduce edges 
that are directed from Xi (resp., Xi and yi) to Zi. We use standard graph-theoretic 
terminology to talk about this dag: sinks, sources, predecessor/successor nodes, 
etc. If (u, v) is an edge of the dag, we call u the predecessor of v, and call v the 
successor oi u. The nodes labeled by input variables or constants are source nodes 
while the non-source are labeled by programming variables. The sources may be 
called leaves in case the dag is a tree. The non-source nodes are associated with 
an operation (±, x , ^^f■) - so we may speak of “radical nodes” , “multiplication 

nodes”, etc. Variables that are not used correspond to sink nodes in the dag. The 
main variable corresponds to a sink node which we call root. The radical depth of 
a node u is the maximum number of radical nodes in a path from u to any root 
node, inclusive of the end points. Thus, if u is a radical node, then the radical 
depth of u is at least 1. For each node u, its induced dag is the subdag comprising 
all the nodes that can reach u along a path. A SLP is said to be rooted if the 
root is the unique sink. The dags corresponding to expressions are ordered trees 
(hence rooted). Our SLP’s are assumed rooted unless otherwise noted. 

Values. Let u = (ui,...,Wm) be the input variables. For each variable u in 
a SLP 7T, we inductively define its value to be an appropriate element valjriu) 
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in an algebraic extension of Q(u). The extension is obtained by adjunction of 
square roots. The value of tt is the value of its main variable. More precisely, let 
Qo = Q(u) and define the tower of extensions defined by tt to be 

<5o C Qi C g2 C • • • C (5) 

where Qi := Qi-i(y^) and the fth square-root in tt has operand Oj € Qi-i- A 
SLP TT is also said to compute a collection V C Q^. of values provided each v GV 
is the value of some variable in tt. 

Rational Degrees. Let a; be a node in a SLP tt. We define the rational degree 
rdeg.„.(a;) of x (the subscript tt is usually dropped). We need some auxiliary no- 
tions. For any node or variable x, let RAD(cc) denote the set of radical nodes in 
the subdag of tt rooted at x. Write RAD(x,i/) for RAD(a;) \RAD(y) (set differ- 
ence). Also let p(x) := I RAD(x)| and p(a;, j/) := | RAD(x, y) |. We will inductively 
define rdeg(x) to be a pair of natural numbers (a,b) G N^, but usually write it 
as “a : 5”. These two numbers are the “upper” and “lower” degrees of x and 
denoted udeg(x) and Ideg(cc). Thus, 

rdeg(x) = udeg(x) : Ideg(cc). 



Assuming rdeg(cc) = ax ■ bx and rdeg(j/) = ay : by, we inductively define rdeg(z) 
using the table: 



z 


udeg(z) 


Ideg(z) 


constant 


0 


0 


parameter 


1 


0 


X X y 




bx2P^y’^'> + by2P^^’yi 


xGry 


ax2P^V’^l + by2P^^^yl 


bx2P^y^^i + ay2P(^’yl 


x±y 


max(a,^2P(2^’“^l -h -h ay2P^^^yl) 


bx2P^y’^i + by2P‘-^’y'> 


yfx 







The rational degree of the SLP tt is defined to be a : 6 where a = max^, udeg(a;), 
b = maxx Ideg(x), and x ranges over the nodes in tt. Note that if tt is division- free, 
then ldeg(a;) = 0 for all x. 



Alternative Approach. It is useful to have an alternative approach to rdeg 
which does not involve p{x,y) or p{y,x). In particular, we define rdeg 2 (^:) = 
udeg 2 ( 2 ) : ldeg 2 (z) inductively using the following table: as before, we assume 

rdeg 2 (x) = ax ■■ bx and rdeg 2 (y) = ay : by. 



Z 


udeg2(^) 


ldeg2(z) 


constant 


0 


0 


parameter 


1 


0 


X X y 


Q^x “1” 


bx + by 


xGry 


ax + by 


bx + ay 


x±y 


maxjaa; -I- by, bx + ay} 


bx + by 


Vx 


O-x 

2 


2 
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Notice that these degrees are no longer natural numbers but binary fractions. 
The following lemma gives the connection between the two definitions of rdeg. 

Lemma 1. For any variable z in a SLP, we have 

udeg(z) = 2^^^) udeg 2 (z), Ideg(z) = ldeg 2 (z). 



3.2 Equivalent Transformations 

Two variables (resp. SLP’s) are said to be equivalent if they have the same 
value. Transformations of an SLP that do not change its value are called equiv- 
alent transformations (but the set of computed values may change). Equivalent 
transformations may change the rational degree, as when applying the distribu- 
tive law: 

z{x y) ^ zx zy . (6) 

It is easy to verify that the rational degree of the left-hand side is at most that 
of the right-hand side. We next show that the rational degree is preserved in the 
absence of division (but allowing radicals): 

Lemma 2. If tt is division- free, then the transformation (0) preserves rdeg of 
TT. In particular, 

rdeg( 0 (a: -|- y)) = ideg{zx -\- zy). 

Proof. We only need to consider the upper degrees. With udeg(a:) = Ox, etc, as 
before, we have 

udeg( 2 (a; -f y)) = -f max{a,,2^(*^’“), 



while 

udeg(^:r + zy) = max{a,,2'’("^'""), 

= max{(a^2^(^’^) -f (a^2'’(^’^) -f 

The lemma follows if we now verify the following: 

RAD(a:y, z) = RAD(a;, z) ttJ RAD(zy, zx), 

RAD(xy, z) = RAD(y, z) ttl RAD(zcc, zy), 

RAD(z, xy) l±l RAD(y, a;) = RAD(z, x) ttl RAD(zy, zx), 

RAD(z, xy) ttl RAD(a:, y) = RAD(z, y) ttl RAD(zcc, zy). 

Our notation here, A ttl i3, refers to disjoint union of the sets A and B. Let us 
only prove the first equation: the RHS is equivalent to RAD(x, z) ttl RAD (y, zx). 
We may verify that the union is indeed disjoint, and equal to RAD(a;y, z). The 
other equations can be proved similarly. We omit the details here. ■ 

Next, we show that applying the associative laws for multiplication and addition 
does not affect rational degree. This follows from the following general result: 
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Lemma 3. Let Xi he variables in tt and Vi = | RAD(a;i, . . . , Xfc) \RAD(a:i)|. 
Then 



k 

rdeg(]^a:i) 

k 

udeg(^Xi) 

k 

ldeg(^a;i) 



k 

^rdeg(a;i)2’'‘ 

k 

m^{udeg(a;i)2’’‘ + ldeg(a;,)2’'^ } 

i—1 

j=i jA* 
k 

^ldeg(x*)2’'* 

i=l 



The above lemma justifies a generalization of SLP’s in which we allow addi- 
tion nodes and multiplication nodes to take an arbitrary number of arguments. 
These are called “sum” or ^-nodes, and “product” or fj[-nodes, respectively. 
Such an SLP is called a generalized SLP. A path in a generalized SLP dag is 
said to be alternating if along the path, no two consecutive nodes are ^-nodes 
and no two consecutive nodes are fj[-nodes. The SLP is alternating if every path 
is alternating. Clearly, any SLP can be made alternating without changing its 
rational degree. We can eliminating any non-alternating path in the SLP by 
aggregating the consecutive additions (or multiplications) using the X) (or 0) 
operations. This process will terminate because each elimination reduces the 
number if nodes in a SLP. 



3.3 Preparation 

A SLP in which the last three steps has the form 



X ^ y/wc 
y ^ X X wb 

z -^y + WA 

is said to be prepared (or in prepared form). Here wa,wb,wc are variables or 
constants. Thus z is the main variable, and x is the last radical variable to be 
introduced. Intuitively, the radical x has been brought up as close to the root 
as possible, in preparation for a transformation (to be introduced) to remove 
the radical. We also call x the prepared variable. If the values of wa,wb,wc 
are given by the expressions A,B,C (resp.) then the value of z is given by the 
expression 

a + bVc. 

Note special forms of this expression when A = 0 or i? = 1, or both. If the SLP 
has no square roots, it is considered prepared already. Our goal is to prepare a 
given SLP, and to bound the resulting rational degree. 
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Let us now prepare a radical node Aq with radical depth 1. Assume the SLP 
is division- free. Let A„, be expressions (n > 0). The expressions E„, for n > 0 

is defined inductively as follows: Eq = Aq x Bq, and for n > 1, 

En = (En-i + A„)Bn = ((En-2 + -|- A„)B„ = ■ ■ ■ . 

To show the dependence of En on the A„,’s and Bn’s, we may also write En = 
En(Ao, Bg, Ai, Bi, . . . , A„, Bn). Viewed as a tree, E„ is essentially a single al- 
ternating path from the root down to Ag. This path is left-branching only and 
the root is a x-node. Also write: S(„) := n;=oB,. 

Lemma 4. For n > 1, the expression En{Ag, Bg, . . . , An, Bn) is equivalent to 
the expression 



' — (^0 X ^(n)) + En-l{Ai, B\, . . . , An, Bn) 

Moreover, if En is division- free, then rdeg(if„) = rdeg(if^). 

Proof. Proof by induction. When n = 1, 

El = (Aq X Bg) Ai) X Bi 

= (Aq X Bg X Bi) -I- Ai X Bi. 

Assume that this lemma is held for n < k, then for n = k 1, 

Ek+i = {Ek Afc+i) X Bk+i 

= ((Aq X B(fc)) -l- Ek-i{Ai, Bi, . . . , Ak, Bk) Ak+i) x Bk+i 
= (Aq X B(^k+i)) + Ek{A\, Bi, . . . , Ak+i, Bk+i). 

Thus we know the equivalence of this transformation is held for any n £ N. 

In both cases, we only apply the distributive and associative laws, which do 
not change the rational degree when En is division free. ■ 

This is illustrated in the case n = 2 by Figure El Note that the variable Ag 
is prepared in Actually, En in this lemma can be a generalized SLP so that 
the Ai,Bi’s need not be distinct and the nodes can be and f|-nodes. Then 
there is a corresponding equivalent SLP E'n, this is the version that we will use 
in the next theorem. 

We address the problem of multiple uses of a node. A node u is used k times 
if there are k distinct paths from the root to u. If a radical node u of radical 
depth 1 is used k times, then if we judiciously apply the previous lemma k times, 
each time eliminating one “use” of u, we obtain: 

Theorem 1. Suppose tt is a division-free SLP and u is a radical node in tt with 
radical depth of 1 . Then we can transform tt into an equivalent SLP tt' such that 
udeg(7r) = udeg(Tr'). Moreover, either no node in tt' has the value valT^fu) or 
else, there is a node u' in tt' with the following properties: 
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Fig. 3. The Transformation E2 E'2. 



1. u' is the prepared variable in tt' 

2. u' is the unique node in n' with value valT^{u). 

Proof. We may assume that tt is a generalized, alternating SLP. Fix any path p 
from u to the root and we may assume that this alternating sum-product path 
has the same form as the path from Aq to the root of En in lemma 4. We then 
apply the previous lemma in which u now plays the role of the node Aq in En- 
This collapses the path p to length 2, as in the lemma and the resulting SLP is 
in a prepared form E' = u x A + B. li the variable u is used in A and/or B, 
then we can repeat this process for another path p' (if any) in A or B. We can 
repeat this process for the subexpressions A and/or B, if they contain references 
to the node u as well. There are two cases: 

1. M is used in A, then A is transformed to A' = u x A\ + B\ and E' = 
uxBi + {Aiu^+B). Remember that m is a square root and thus the expression 

effectively eliminates the square root operation here; 

2. u is used in B, then B is transformed to B' = u x A 2 + B 2 and E' = 
u X (A -|- A2) -f -62- 

In both cases, we can see that A' is still in a prepared form. We keep this process 
until there is no use of u except the one that is in the prepared position and has 
a unique path to the root with length 2. Since there must be a finite number 
of uses of u, this iterative process will eventually terminate. At that point, the 
resulting SLP tt' has the desired form: tt' is prepared and u is the main prepared 
variable. It is also clear that if there are other nodes with the same value as u, 
they can also be merged with u by the same process. Hence, u will be the unique 
node with value valTr{u). 

Note that we apply the commutative, associative and distributive laws in 
these transformations. The commutative and associative transformations do not 
change the rational degree. Since tt is division free, LemmaEltells us that the dis- 
tributive transformation preserves the rational degree too. Therefore, the prepa- 
ration transformation does not change the rational degree of tt. ■ 

We say that tt' is obtained by the process of “preparing” u in tt. 
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3.4 Main Result 

Let 7T be a SLP whose value is P = P(u) G (see Q). We define the real 
function : R™ — >■ M where , Om) is the value of the main variable in 

7T when we evaluate each dependent variable at a = (oi, . . . , am) G R™, following 
7T in a step-by-step fashion. The domain of comprises those a G R™ where 
/^(a) is defined. Similarly, we define an associated real function fv ■ R™ — >■ R. 
Note that the domain of is always a subset of fv- The following example 
shows that it may be a proper subset: let tt compute the value V = 
using Horner’s rule, and let tt' compute the same V using the formula V = 

Then tt and tt' are equivalent, but 7t(1) = n while 7t'(1) is undefined. The domain 
of TT (and V) is R but the domain of tt' is R — {1}. 

Theorem 2. Suppose V = V (u) is the non-zero value of a rooted division-free 
SLP 7T. Then there exists a non-zero polynomial P{u) such that Zero(V) C 
Zero(P) with degP(u) < udeg(7r). 



Proof. We show the existence of the polynomial H(u) by induction on the num- 
ber r of square roots in tt. For r = 0, the result holds because V is already a 
polynomial of degree udeg(7r). 

Assume r > 0 and let u be a radical node of radical depth 1 in tt. We 
prepare u, leading to an equivalent SLP (which we still call tt). The udeg of tt 
is unchanged by this transformation. If C is the value of u, then the value of tt 
can be written as 

v = a + bVc 

where A, B, C belongs to Qr-\ (recall that values of programming variable in- 
troduced before the rth root extraction belongs to the field Qr-i, by definition 
of Qr-i)- li B — 0 then V = A and the result is true by the inductive hypothesis 
applied to A (which has < r — 1 square roots). Otherwise, by applying some 
further (obvious) transformations, we transform tt to some tt' whose value is 

V' = A^ - B^C. (7) 

Note that tt' has < r — 1 square-roots. If W = 0 then 0 = V = {A -\- B'/C){A — 
B'/C). Since Qr is a UFD and V = A-hB\/C 0 (by assumption), we conclude 
that A-By/C = 0, i.e., yfC = A/B G Qr-i- Thus V = A + By/C = 2A. Then V 
can be computed by some SLP with < r — 1 square-roots, and the result follows 
by inductive hypothesis. 

So assume V yf 0. By induction, Zero(P') = Zero(A^ — B^C) C Zero(P) 
for some P with deg(P) < udeg(P'). Since Zero(P) C Zero(P'), it remains to 
show that udeg(P') < udeg(P). We have 



udeg(P) = udeg(A -I- ByfC) 

= max {udeg(A)2^(^'^’^),udeg(H\/C)2'’(^-®^)} 



> max{udeg(A)2i+p(^'^’'^), 



udeg(H)2^('^’^) -h udeg(C)2'’('®’'^) 



2p(a.b=c)| 
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= max{2udeg(A)2^^'® 



^degJ^') 2i+p(g.^^)+udeg(C)2P(^^g) 



2P(A,B^C)y 



> ma.x{udeg{A^)2P^^^^’^\ udeg(s2)2'>('^>^') + udeg(C)2^(^"’^) 

= udeg(A 2 - B"^C) = udeg(l^')- 



3.5 Presence of Division 



What if the SLP is not division-free? Note that the presence of division is very 
common. For instance, when we intersect two lines in the construction, it gives 
rise to an expression with division. There is a well-known transformation to 
move all divisions towards the root, merging them as we go. An instance of this 
transformation is 

A , A' ^ AB' + A'B 
B^ 1^ ^ BB' 

Unfortunately, the number of radical nodes may be doubled because if we move 
a division node past a radical node, we obtain two radical nodes: 



[a 4a 



( 8 ) 



Hence we give two versions of this transformation in the following lemma: in 
version (i) we do not move any division node past a radical node, and in version 
(ii) we remove all but at most one division node. 



Lemma 5 (Elimination of Division). Lei tt be a rooted SLP. 

(i) There is an equivalent SLP ■k' in whieh eaeh division node is either the root 
of 7T or the child of a radical node. Moreover, rdeg(Tr') = rdeg(7r) and n' has the 
same number of radical nodes as tt. 

(ii) There is an equivalent SLP tt" with only one division node which is also the 
root. Ln this case rdeg(7r") < 2’'rdeg(7r). 



The proof of (ii) exploits the alternative definition of udeg(rt). Because the jus- 
tification of the alternative definition is long, we only refer to the details in 

US' 

The value of the SLP tt” has the form A/B where A, B are division- free. 
Intuitively, to check if A/B — 0, we check if A = 0 subject to B ^ 0. Since A 
is division- free, we may apply main theorem (see next Section). This effectively 
amounts to doubling the number of square roots to prove a theorem involving 
division. 



3.6 Improved Square Root Transformation 

It turns out that we can exploit another trick motivated by HE! in order to avoid 
the doubling of the number of square roots. Instead of (jSj), we use the following 
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transformation to extract division out of square roots: 




^ if udeg{A) > udeg(B), 



A 

VaS 



if udeg(A) < udeg(B). 



(9) 



Suppose our transformations for eliminating divisions, using the new rule 
transform an arbitrary expression 2 into U{z)/L{z) where U{z), L(z) are divi- 
sion free. Let and denote the udeg(C/(z)) and udeg(L(z)). To exploit the 
advantages of this new rule, we now give an explicit set of inductive rules for 
computing Uz and Iz'- 



z 


Uz 


Iz 


constant 


0 


0 


parameter 


1 


0 


X X y 


Ux lly 


^x “1” 


x^y 


'^x ly 


lx ^y 


x±y 


max{ua; -1- ly, G -1- Uy} 


lx “1“ ly 


\fx 


“1“ i'^X ^ ^x)‘l 

Uxj {Ux ^a:)' 


lx: (j^x ^ 1x^1 

Ix^-i {Ux ^ lx)- 



Note that PS| only uses one of two clauses in 0 unconditionally. But the 
effect of using the two conditional clauses is that the resulting bound Uz is never 
worse than 2’' udeg(z), which is the bound in LemmaEl The proofs may be found 

in [Tnj . 

4 Proving by Random Examples 

We show how to use our main result to prove theorems about ruler & compass 
constructions. According to Section 2, this amounts to verifying if a radical 
expression G*(u) is identically zero (subject to non-degeneracy conditions). Let 
7 t ( u ) be the natural SLP which computes the values of all the dependent variables 
in a ruler & compass construction, and whose value is the polynomial thesis 
G*(u). We give a simple upper estimate on the rdeg of each node in tt. 

Each “stage” of our construction introduces new points, lines or circles. Let 
us now be more precise: assume that our system maintains three kinds of geo- 
metric objects: points, lines and circles. These are constructed as follows: 

— Points: There are three cases. Case 0: We can introduce an arbitrary point, 
P. Then P.x and P.y are free variables (i.e., parameters). Case 1: We can 
introduce an arbitrary point, P on an existing line L or circle G. We may 
specify either P.x or P.y to be a parameter. The other coordinate is therefore 
a dependent variable, constrained by an equation. Case 2: We can introduce 
a point P that arises from the intersection of a line/circle with another 
line/circle. In this case, P.x and P.y are both dependent variables constrained 
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by a pair of simultaneous equations. There is a variation of Case 2, which 
arises when at least one of the two intersecting objects is a circle. In this 
case, we allow the user to obtain both the points of intersectiouQ. 

— Lines: Given two existing points, we can construct the line through them. 

— Circles: Given three points P,Q,R, we can construct the circle centered at 
P of radius equal to the distance between Q and R. As a special case, if P 
is equal to Q or R, we can just use two arguments for this construction. 



Lemma 6. If the dependent variable x is introdueed at stage i , then rdeg 2 (a:) < 
85®, i.e., udeg 2 (a;) < 85®, ldeg 2 (a:) < 85®. 



Proof Proof by induction. Let Sk be the set of objects (points, lines, etc.) avail- 
able after k construction stages. This lemma is trivially true when k = 0 because 
So is empty. 

Let rfc = 85^. By the induction hypothesis, we assume that the coordinate 
(e.g., for points) or coefficient (e.g., in a line or circle equation) variables for all 
the objects in Sk have rational degrees at most 

Let us first consider the construction of lines and circles. Recall that in our 
system, a line refers to one that is constructed by linking two points in Sk', while 
a circle means one that is constructed with the center in Sk and the radius being 
the length of some segment between two points in Sk ■ We represent a line by a 
linear equation oa; -I- 6?/ -I- c = 0. It is easily verified that the rational degrees of 
a, b and c are at most 2rk,2rk and 6rfc, respectively. Similarly, we represent a 
circle by an equation in the form of (x — a)^ + (y — b)^ = where the rational 
degrees of a,b and c are at most rk,rk and respectively. 

Next, we consider the construction of points. As discussed above, we can have 
one of the three types of construction (Cases 0, 1, 2) in stage (fc -I- 1). Case 0 is 
trivial because all the parameters have the rational degree 1:0. Case 1 can be 
viewed as a simplified Case 2. In the following, we focus on the more interesting 
Case 2 constructions. 

There are three possible constructions in a Case 2 stage. 

First, we consider the intersection of two lines Li : a±x + b\y -I- ci = 0 and 
L 2 : Q 2 X + & 2 y -I- C 2 = 0 where a’s, b’s and c’s can be at most r^. We obtain the 
intersection point {x,y) of these two lines as follows. 



, Cl 62 
^0162 



C2&1 Cl 02 — C20i 

0261 ’ 02^1 — «1^2 



)• 



From the definition (see Section [8.111 . the rational degrees for x and y are at 
most 8rfc. 

Next, let us consider the intersection of a line L : aix + biy + ci = 0 and a 
circle C : {x — 02 )^ + {y — b 2 )^ = c|. We eliminate y and get a quadratic equation 

^ It should be possible to allow the user to pick one of the two points using some 
criteria, but we defer this to a future paper on implementation. This additional 
power is sometimes needed in ruler-and-compass theorems. 
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for X as follows: 




Let A, B and C be the three coefficients in the above equation. It can be shown 
that the rational degrees of them can at most be 4rfc, and lOr^ respectively. 
From the above equations, we get x = and y = _ . Thus, 

rdeg 2 (x) < 23rfe and rdeg 2 (y) < 26rfe. 

Thirdly, we consider the intersection of two circles: C\ : (x — ai)^ + (j/ — 6i)^ = 
cf and C 2 : (a:: — 02 )^ + (j/ — ^ 2 )^ = c^. We subtract them first to obtain a linear 
equation first. Then by arguments similar to those used for the intersection of a 
line and a circle, we can show that the rational degrees for x and y are at most 
69rfe and 85rfc, respectively. 

Therefore, we know that rdeg 2 (a:) < 85* for all the nodes at the stage i. ■ 

REMARK: The constant 85 in the above lemma is clearly very conservative. 
This bound can be refined, for example, by classifying the stages into the various 
types of construction. 

Corollary 1. Let the thesis polynomial be g(u,x) with deg( 5 ) = d, and G(u) be 
any of the 2*' radieal expressions derived from g{u, x) by eliminating dependent 
variables. Then ideg^{G) < td2^85^ where g(u, x) has t terms and k is the 
number of eonstruction stages. 

Proof. For Lemma El we know that the rational degrees for all the dependent 
and independent variables are at most 85^. The thesis G has t terms with total 
degree at most d. By the inductive definitions of rational degrees, we know that 
rdeg,,(G) < td2’'85'=. ■ 

Assume an incremental construction with m parameters, n dependent vari- 
ables, k stages, and r quadratic equations. Note that t is at most 
Moreover, d < 2 in most classical geometric theorems. In our implementation, 
instead of relying on this crude upper bound, we actually compute the actual 
bounds on rdeg to achieve better performance. By applying Lemma El^ii) to 7T, 
we obtain tt" with one division at the root, and rdeg(7r") < 2’'rdeg(7r). Now the 
value of 7 t" (which is G*) has the form A/ B where A, B are division- free. More- 
over, rdeg„.//(G*) < td2^*’85^. Clearly, Zero(A/R) C Zero(A). Without loss of 
generality, assume A yf 0. By our main theorem, Zero(A) C Zero(P) for some 
polynomial P of degree < fd2^’’85*^. Then we invoke a simple form of Schwartz’s 
lemma: 

Fact 1. Let P{u) be a non-zero polynomial of degree at most D. Lf eaeh Oi 
(i = 1, . . . , 771^ is randomly chosen from a finite set S' C K. Then the probability 
that P(oi, . . . , am) = 0 is at most D/\S\ . 

If we randomly pick the values a = (oi, . . . , Om) S S’”, and |S| = td2'’+^’'85^ (for 
any c > 1) then the “error probability” of our procedure is given by Pr{A(a) = 
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0} < Pr{P(a) = 0} < 2“'^. This constitutes our probabilistic verification of the 
universal truth of “G*(u) = 0”. 

An alternative to testing G*(u) = 0 is viewing the problem as testing the 
simultaneous vanishing of a set of polynomial t/ :={Gi(u), . . . , G 2 '-(u)}. This 
reduces the complexity in two ways: 

— The root bound (which determines the precision necessary to numerically 
determine the sign of radical expressions in the Core Library) is smaller. 

— The size of the test set S is smaller. 

We also have a further choice when testing Q: we can randomly choose some 
Gi to test for its vanishing, or we can choose to randomly test all the G^’s for 
their vanishing. However, the random choice of Gi does not seem to be the most 
efficient way to test a theorem. 



Degeneracies of the First Kind. We now address the generic truth of 
“G*(u) = 0”. The notion of “error probability” becomes an interesting issue. 
First consider only non-degeneracy conditions of the first kind, A : 6 ^ 0. For 
simplicity, assume the ith ruler & compass construction step introduces exactly 
one such condition, Si yf 0, of degree < 2. Since there are k stages of construc- 
tion, the non-degeneracy condition becomes S* := 8162 ■ ■ ■ Sk yf 0. The degree of 
S* is thus at most 2k. 

There are two natural models of what it means to have an “error probability” 
< 2“'^: (A) The “strict model” says that our sample space is now restricted to 
S”” \ {a : (f(a) = 0}. (B) Alternatively, we can say that the sample space is still 
S'™ but the theorem is trivially true at S™ fl {a : (5(a) = 0}. Given a finite test 
set S, the possible zeros of 5* (i.e., degenerate configurations) in S™ is at most 
2^” udeg(5*)|S|™“^. With a large enough test set S, we can make the probability 
that degenerate cases are chosen in the test (i.e., 2^” udeg(<5*)/|S|) arbitrarily 
small. We adopt the model A in the next theorem: 

Theorem 3. Conjectures about ruler & compass constructions with s non-dege- 
nerate conditions of the first kind can he verified with error probability < 2“"' 
in time polynomial in the parameters 2”, 2^, fc, c, lg(t) and lg((i), where r is the 
number of square roots in the thesis radical expression G(u), k is the number 
of construction stages, t is the number of monomials in the thesis polynomial 
(/(u, x), and d is the total degree of g. 

Proof. Each construction introduces a constant number of new operations into 
the final radical thesis expression G*(a). Thus, the cost to construct the the- 
sis expressions G*(a) is bound by 0{k). Next, let us consider the complex- 
ity in verifying G(a) for some sample configuration a = (oi, 02 , . . . a™) ran- 
domly chosen from a finite test set S with a cardinality of From 

the discussion above, we know that the failure probability of this test is at most 
2“°. Without loss of generality, we can assume all the elements in S are in- 
tegers. So the bit length of each instance value is bounded by L = Igd-SI) = 
0{r -\- c-\- lg(t) -I- lg(d) -I- fc). In our root bound based approach to determine 
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the exact sign of an algebraic expression m, the number of bits which need to 
compute in the verification is bounded by 0 {pL 2 ^''), where p is the total num- 
ber of operations in G* which is bounded by 0 {k). It is known that the time 
complexity of arithmetic operations among multiple precision numbers are no 
more than 0 {i^) where I is the bit length of operands. We have a total of 2'’ 
radical thesis expressions to verify. So the complexity to verify the vanishing of 
G* , when exact arithmetic is employed, is polynomial in 2’’, /c, c, lg(t) and lg(d). 

In presence of s non-degeneracy conditions of the first kind, let Z\(u) be the 
product of all of them. It is a radical expression in u. By our main theorems, the 
number of zeros of A in 5™, N, is polynomial in 2® and 2’’. In the worst case, 
we may meet at most N degenerate cases before we get the first non-degenerate 
one. So the worst case complexity for our complete method is polynomial in 
2’’,2®,fc, c, lg(t) and lg(d). ■ 



Degeneracies of the Second Kind. As noted, degeneracies of the second 
kind can often be reduced to simple constraints on the domains of the param- 
eters, possibly depending on the values of other parameters. For instance, we 
noted that in Pascal’s Theorem, the parameters Ui (f = 2 , . . . , 6) must satisfy 
\ui\ < |ui|. Our prover can handle such degeneracies by exploiting the following 
more general form of fact^ define the generalized degree of p{xi, . . . ,Xn) to be 
(di, . . . ,dn) where the degree of p is di when viewed as a polynomial in xi and 
its leading coefficient inductively has generalized degree (^2, . . . , dn). Suppose 
Si, . . . , Sn are finite sets of real numbers, then it can be shown that if we choose 
(ui, . . . , Un) randomly from Si x S2 x ■ ■ ■ x Sn, the probability that p is non-zero 
and p{ui , . . . , u„) = 0 is at most 

di dn 

l^ll l^nl' 

The main extra complexity caused by this version of our prover is that we 
need to evaluate the parameters at rational values (instead of just at integer 
values) . 

The current implementation does not handle the second kind of degeneracy in 
the above way, but we plan to rectify this in the future. Instead, it detects when 
an example a £ S'™ is degenerate, discards it and generates another example, 
etc. Under probability model (A) above, this means that we do not have an a 
priori bound on the running time, but the error probability is correct. Of course, 
under model (B), there is no need to generate another example; but this does 
not seem like a reasonable model. 



Degenerate Ruler-and- Compass Constructions. Certain theorems amount 
to detecting the validity of construction steps. We give a simple example from 
p] of a theorem true in real geometry but false in the complex geometry. The 
construction amounts to picking two points Pi (0,0) and P2(u,0) where u is a 
free parameter. Also let P3 be the midpoint of P1P2, and P4 the midpoint of 
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P1P3. Let L be the bisector of the segment P 1 P 2 , and C be the circle centered 
at Pi with radius P1L4. Let P5 be the intersection of L and C. The thesis is 
Pi = P 2 or equivalently u = 0. This conjecture is true in real geometry, but it is 
false in the complex plane because u = \/— 1 is a solution. This is an interesting 
example because the thesis does not depend on the construction at all. It is an 
indirect way of asserting the validity of the construction steps. In implementing 
a prover that takes inputs from the user, we need to guard against being asked 
to prove such theorems. This amounts to an extreme form of the second kind of 
degeneracy. 



Timing. The following table lists some theorems from Chou 0. However, the 
last row (Tri-Bisector theorem) is the real geometry example from Section 2. The 
timings are for two values of c (this means the probability of error is at most 
2~^). We also arbitrarily “perturb” the hypothesis of each theorem by randomly 
changing one coefficient of one of the input polynomials, and report their timings 
as well. These are all false theorems, naturally. Our tests were performed on a 
Sun UltraSPARC-IIi (440 MHz, 512 MB). The times are all in seconds, and 
represent the average of 6 runs each. The prover uses Core Library, Version 1.3. 
Actually, the library is directly modified so that we compute the exact rational 
degrees of the expressions (rather than use the estimates of the Lemma 0) . For 
comparison, we include the timings reported by Chou 0 using the approaches of 
Wu and of Grobner Bases. The final column in the table gives the page number 
in Chou’s book 0. 



No. 


Theorem 


c= 10 


c = 20 


Perturbed 


Char Set 


Grobner 


Page 


1 


Pappus 


0.020 


0.020 


0.007 


1.52 


33.32 


100 


2 


Pappus Point 


0.110 


0.113 


0.023 


4.87 


67.62 


100 


3 


Pappus-dual 


0.020 


0.020 


0.013 


1.45 


25.53 


111 


4 


Nehring 


8.300 


8.390 


0.107 


4.15 


159.3 


115 


5 


Chou-46 


0.070 


0.073 


0.020 


88.13 


37.65 


124 


6 


Ceva 


0.030 


0.033 


0.017 


1.12 


3.47 


264 


7 


Simson 


193.22 


262.49 


0.023 


1.22 


5.02 


240 


8 


Pascal 


1715.8 


2991.6 


0.037 


29.6 


>14400 


103 


9 


Tri-Bisector 


20.027 


38.350 


0.010 


- 


- 


- 



Let r be the number of square roots in the radical expression representing a 
theorem. If r = 0, we say the theorem is linear. A large par10of the 512 theorems 
in Chou’s book are linear. Only the last two theorems (Simson and Pascal) in 
the above list are non-linear, with r = 1 and r = 5, respectively. Evidently non- 
linear theorems represent a challenge for our current system. Recall that there 
are 2” (or 2”“^ by symmetry) possible sign assignments to the radicals in G(u). 

^ The theorems in Chou’s book include an original list of 366 theorems from 0, of 
which 219 are reported to be linear U P- 12]. 
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Our prover has three verification modes: (1) random mode, (2) exhaustive mode, 
and (3) specified mode. These correspond, respectively, to testing (1) a random 
sign assignment, (2) all sign assignments and (3) a user-specified assignment. For 
linear theorems, these modes are irrelevant. In the above table, we test Simson’s 
theorem in the exhaustive mode, Pascal’s theorem in the random mode and Tri- 
bisector in the specified mode. So our timing for Pascal’s theorem should really 
be multiplied by 2 '^ = 16. 

It is interesting to note that we have never observed a single wrong conclu- 
sion from our probabilistic tests - all true theorems are reported as true, and 
all perturbed theorems are reported as false. In some sense, that is not surpris- 
ing because the probabilistic bounds based on Schwartz’s lemma seem overly 
conservative in all real situations. 

The running times for linear theorems are pretty consistent across different 
runs. However, for the non-linear theorems, the timing can show much more 
variation. This is not unexpected since the running time depends on the bit size 
of the random example. A more prominent behavior comes from the clustering of 
times around certain values. For instance, for Simson (c = 20), the times cluster 
around 10 seconds and around 70 seconds. This “multimodal” behavior of the 
timings are again seen in Pascal. This can be attributed to the random choice 
of signs for the radicals in non-linear theorems. This may also account for the 
curious relative times for Simson c = 10 and c = 20. 

The performance of our library is critically dependent of good root bounds 
(an area of research that we are actively working on uni). It should be possible 
to exploit prover-specific techniques to improve the speed, but this has not been 
done. There are several issues to bear in mind when comparing our method with 
Wu’s method: 

— Chou’s timings would look considerably better using hardware available to- 
day. 

— The actual theorems proved by Wu’s method are not strictly comparable to 
ours in two important aspects: Wu’s method proves theorems about complex 
geometry while ours is about real geometry. On the other hand, Chou’s 
algorithm is deterministic while ours is probabilistic. 

— Our method is extremely effective for discarding wrong or perturbed con- 
jectures. It is unclear if Wu’s method will be much faster for perturbed 
theorems, since the algorithm would still have to execute the same basic 
steps. The ability to quickly reject false theorems is extremely useful in ap- 
plications where the user has many conjectures to check but most of the 
conjectures are likely to be false. 

— One of the strengths of Wu’s methods (as compared to Grobner bases, say) 
is its ability to discover non-degeneracy conditions. A similar capability is 
embedded in our approach ~ this simply amounts to detecting when a con- 
struction step is ill-defined. 
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5 Final Remarks 

In this paper, we have developed a generalization of the Schwartz-Zippel ran- 
domized zero test for the class of radical expressions. Such a test is expected to 
have many applications as radical expressions are quite common. Here, we focus 
on their use in proving theorems about ruler & compass constructions. Some 
features of our prover are: 

— It proves theorems about real (rather than complex) geometry, under the 
limitation that there is no inequalities appearing in the thesis. 

— It is probabilistic, so that speed can be traded-off against error probability. 

— It detects wrong conjectures very quickly. 

— It is extremely effective for linear theorems (the majority of the theorems in 

0 ). ^ ^ 

— It exploits the special nature of ruler & compass constructions. 

Because of the last feature, our approach may ultimately prove to be more 
efficient for this class of problems than other more general techniques. However, 
our results so far have not been conclusive in the case of non-linear theorems. 
The following are some open problems: 

— Improve our zero test for straight line programs that involve division. 

— Develop techniques to make our approach faster for non-linear theorems. 

— Extend our randomized techniques to theorems that have inequalities in the 
theses. This seems to call for radically new ideas. 
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Abstract. The aim of the present paper is the following: 

- Examine critically some features of the usual algebraic proof protocols, 
in particular the “test phase” that checks if a theorem is “true” or not, 
depending on the existence of a non-degenerate component on which it 
is true; this form of “truth” leads to paradoxes, that are analyzed both 
for real and complex theorems. 

- Generalize these proof tools to theorems on the real field; the gen- 
eralization relies on the construction of the real radical, and allows to 
consider inequalities in the statements. 

- Describe a tool that can be used to transform an algebraic proof valid 
for the complex field into a proof valid for the real field. 

- Describe a protocol, valid for both complex and real theorems, in which 
a statement is supplemented by an example; this protocol allows us to 
avoid most of the paradoxes. 



1 Introduction 

The methods of algebraic proof in Euclidean geometry, notably the Wu-Ritt 
characteristic set method and Kapur’s Grobner basis method USEEH, have 
been a striking success, as for example testified by jS|, in which hundreds of 
theorems in Euclidean geometry are proved. See also mm- 

These methods start from an informally and incompletely specified theorem 
(i.e. a theorem in which some implicit assumptions — e.g. the three vertices of 
a triangle are not aligned — are made) and, through a series of steps, obtain a 
formal statement of a theorem and a proof (or a disproof) of it. 

Some of the steps of an algebraic proof method can be fully automated 
through completely defined sub-algorithms, and some instead require human 
expertise, without excluding that an automatic equivalent may be obtained 
through expert systems. 

To specify a method of algebraic proof one has to specify the different steps, 
the algorithmic steps and the “expertise” steps, the languages in which the 
different steps are expressed, etc; we call such a specification a protocol. 

* Work partly performed within the Project “Tecniche per Immagini”, cluster C15, 
progetto n. 7 “Sviluppo, Analisi ed Implementazione di Metodi Matematici Avanzati 
per il Trattamento di Immagini”, with the contribution of MURST. 



J. Richter-Gebert and D. Wang (Eds.): ADG 2000, LNAI 2061, pp. O.')^ 2001. 
(c) Springer- Verlag Berlin Heidelberg 2001 
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A common feature of many such protocols is the following: the theorem 
hypothesis and thesis are translated into systems of equations between the coor- 
dinates of the geometric entities involved (usually points, since all entities may 
be reduced to a set of defining points); these equations define two affine alge- 
braic varieties, the hypothesis variety 'H and the thesis variety T; some of the 
components of the hypothesis variety are considered degenerate; a theorem is 
considered true if a non-degenerate component (or every non-degenerate com- 
ponent, depending on the protocol) of the hypothesis variety is contained in the 
thesis variety. The theorem is hence completed by adding to the hypothesis some 
non-degeneracy condition (usually in the form of an inequation) that allows to 
discard the components on which the thesis is not true. 

One of the most used proof protocols requires us to identify a construction; 
this is both a serious limitation, since many interesting theorems cannot be for- 
malized in this way, and human expertise is necessary to identify the construction 
steps: this is especially well explained in |Sj. 

The algebraic proof protocols in use have often two serious drawbacks: 

— it is possible to prove “intuitively false” theorems, i.e. theorems that are true 
only on a component whose points are all intuitively degenerate, and false 
on components whose points are intuitively non-degenerate; 

— the proofs are valid on an algebraically closed field (and not on the real field). 

For some classes of theorems the proof of a theorem on the real ground 
field, when the theorem is true on the complex field, is straightforward, but the 
problem of the identification of such classes, to our knowledge, has not been fully 
addressed. Moreover there are two classes of theorems on the reals for which the 
standard methods of algebraic proof are useless: 

— theorems that are false on the complex field; 

— theorems for which the statement is impossible over the complex field, since 
the statement uses the ordering in the real field. 

For these classes of theorems other protocols have been developed, for ex- 
ample real quantifier elimination [muni; these methods, however, are often 
much slower than the purely algebraic methods that use Wu-Ritt characteristic 
sets or Grobner bases. 

The aim of the present paper is the following: 

— Examine critically some features of the usual algebraic proof protocols, in 
particular the “test phase” that checks if a theorem is “true” or not, de- 
pending on the existence of a non-degenerate component on which it is true; 
this form of “truth” leads to paradoxes, that are analyzed both for real and 
complex theorems. 

— Generalize these proof tools to theorems on the real field; the generalization 
relies on the construction of the real radical, both of ideals and of semi-ideals, 
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the generalization to systems of equations and inequalities to the concept of 
ideal of a polynomial ring for a system of equations. 

— Describe a tool that can be used to transform an algebraic proof valid for the 
complex field into a proof valid for the real field. This tool may fail, without 
giving a proof that the theorem is false on the real field. 

— Describe a protocol, valid for both complex and real theorems, in which a 
statement is supplemented by an example, and allows us to avoid most of the 
paradoxes. This protocol can prove a true theorem, but may fail to disprove 
a false theorem. 

— Give some hints of a more difficult problem concerning real theorems that 
are algebraically false but for which a semialgebraic completion and a corre- 
sponding proof might be automatically determined. 

We do not discuss the choice of an algorithm for the steps of the protocols 
described; hence, for example, we do not take a position between different char- 
acteristic set or Grobner basis methods, as far as they are usable to perform some 
commutative algebra operation such as discovering the existence of a component 
of an affine algebraic variety with some property needed by a protocol. The only 
exception is a sketch of an algorithm for the computation of the real radical, 
since practically efficient algorithms for this computation are not present in the 
literature. 

Many of the items discussed in the paper are not new; many issues are already 
discussed in as well as in more recent literature; our contribution proposes 

a different accent that might help expanding the automatic deduction methods 
into some of the less explored regions of algebraic and semialgebraic theorems, 
notably the theorems that do not correspond to a construction. In particular, we 
are as much interested in disproving false “theorems” (and avoiding to “prove” 
false “theorems”) as in proving true theorems; this is especially important if one 
wants to establish a protocol that can mechanically produce conjectures and 
prove or disprove them without further human intervention. 



2 Geometric Theorems and Algebraic Proofs 

2.1 Proof Protocols 

An algebraic proof protocol consists in a series of steps that, starting from an 
informally stated theorem, produce a formal theorem with a proof or a disproof, 
completing correctly the hypothesis with additional conditions that were implicit 
or overlooked in the initial statement. Some protocols moreover start from the 
theorem hypothesis only, and try to discover some possible thesis that, together 
with some additional conditions as above, produces a true (and non-trivial) 
theorem with a proof. 

The aim of an automatic proof protocol is to have as much as possible of 
these steps to be carried out without human intervention. This might be espe- 
cially difficult for the translation from the initial informal statement expressed 
in natural language into a formalized statement; for this an expert system might 
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be conceived, but seems presently beyond reach. Recognizing and re-translating 
into geometric form the additional conditions needed is also difficult, especially 
if an automatic translation is required, but can sometimes be done, 

A proof protocol may consist, for example, in the following phases: 

1. Formalization: give a formalized expression of the hypothesis and the thesis 
as conjunctions of relations between geometric entities (points, lines, conics, 
etc.). 

2. Formalization through points: translate into a form involving only points, 
(with the possible addition of auxiliary points). 

3. Algebraic translation: translate into a form involving polynomial equa- 
tions between the coordinates of the points. 

4. Test phase: prove that the theorem can be completed with algebraic side 
conditions in the form of inequations yielding a true theorem. 

5. Algebraic completion: find inequations yielding a true theorem. 

6. Geometric completion: find geometric conditions {side conditions) to add 
to the hypothesis to ensure the truth of the theorem. 

7. Formal proof: this can be either an algebraic proof (algebraic manipulation 
of the coordinates) or an human-readable geometric proof, or something in 
between. 

As an example of an application of such a protocol, consider the theorem: 

Theorem 1. The bisectors of the angles of a triangle meet in the center of the 
in-circle. 

This is first translated into a statement like: 

Theorem 2. Let A,B,C be three points, let AB, BC,CA be lines connecting 
them, let a (resp. b,c) be a line through A (resp. B,C) and a point equidistant 
from AB and CA (resp. AB and BC, BC and CA); let D be a point on a and 
b; then D lies on c and the circle with center D and tangent to AB is tangent 
to BC and CA. 

Remark here that we have avoided to quote the line, the point, etc. to avoid 
implicit uniqueness assumptions: the language in which we have to specify these 
translations does not allow this form of statements. 

The theorem as expressed now is false for several reasons: it does not rule 
out degeneracies (like the three points A, B,C being aligned) but is also false if 
two bisectors are internal and one is external: this is due to a first translation 
that is not clever (we cannot expect an automatic translation to be clever; a 
clever translation requires a clear view of the geometric situation, and might be 
available only to somebody — man or machine — that knows the proof). 

The theorem is now translated, eliminating any reference to lines and circles, 
into a theorem on points only of the type: 

Theorem 3. LetA,B,C be points. Pi (resp. P2,P3) be a point equidistant from 
the lines through A,B and A,C (resp. A, B and B,C, A,C and B,C); then 
Pi,P2, P3 coincide, call it D; let moreover Qi (resp. Q2, Q3) be a point on a line 
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passing through D and perpendicular to a line passing through A, B (resp. A, C, 
B,C). Then the distances DQi, DQ 2 , DQ^ coincide. 

(This theorem too, being a translation of the previous theorem, is of course 
logically false.) 

The theorem hypothesis and thesis are now translated into sets of equations 
between the coordinates of the points A, B,C, Pi, P 2 , P 3 , D,Qi,Q 2 ,Q 3 , and one 
has to prove that a component of the hypothesis variety defined by the ideal of 
the set of hypothesis equations is contained in the thesis variety. This is the test 
phase. 

We have then to find algebraic side conditions, and to discover that the initial 
translation was incomplete: one has to avoid that A,B,C are aligned and that 
c is perpendicular to the line through C and Pi; hence the correct statement is 

Theorem 4. Let A,B,C be three points, not aligned, let AB , BC,C A he lines 
connecting them, let a (resp. b) he lines through A (resp. B) and a point equidis- 
tant from AB and CA (resp. AB, BC ); let a and b meet in a point D; then D 
is equidistant from AC, BC, and the circle with center D and tangent to AB is 
tangent to BC and CA. 

This is, however, not yet an accurate translation of the initial statement, 
since a conditions implicit in it (that the bisectors are internal to the triangle) 
has not been taken into account; this hypothesis however cannot be formalized 
algebraically, since it needs an ordering on the ground field, and the four tri- 
tangent circles are in the same algebraic component anyway. We have proved 
a slightly less general theorem than that we intended. In other cases, however, 
we may end with a completely different theorem than the one that was initially 
stated; sometimes this might end up with a proof of the truth of a theorem that 
in its initial statement is intuitively false (we’ll give an example later). 

The test phase is the one that is usually considered the phase providing 
the proof of the truth of the theorem. Several paradoxes arise however from 
considering true a theorem that has passed the test phase. The test phase usually 
consists in proving that the affine algebraic variety defined by the polynomials 
in which the hypothesis has been translated (the hypothesis variety) has a non- 
degenerate (algebraic) component on which the theorem is true (an alternative 
being to require that the theorem is true on every non-degenerate variety); the 
definition of non-degeneracy is a part of the protocol. 

We will not formalize further the notion of proof protocol; and we will con- 
centrate mainly on the test phase. 



2.2 Configuration Equational Theorems 

We consider a restricted class of theorems in Euclidean geometry that we call 
configuration equational theorems. These theorems have the form: 
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where hi,kj are algebraic relations, i.e. relations between geometric entities 
(points, lines, circles, . . . ) that can be translated into the vanishing of one or 
more polynomials in the corresponding coordinates. 

It is usual to represent all the relations as relations between points. The 
hypothesis and the thesis of a theorem of this kind are hence the conjunction 
of equations between the coordinates of a set of indeterminate points. In our 
examples we will use only relations of this type, but it is possible to handle 
other geometric entities through their coordinates; for example, handle lines in 
space through their Grassman coordinates. In this case of course the Grassman 
equations have to be considered. 

Some of the relations considered between points (the ones that we use in our 
examples) are the following (we use the notation := to denote that the right 
hand side is a definition of the left hand side, and the coordinates of a point 
denoted by an uppercase letter X are denoted by the corresponding lowercase 
letters {x\,X 2 ))- 

— X,Y,Z are aligned: 

/ xi yi zi \ 

col{X, Y, Z) := det X 2 j /2 -^2 = 0 

Vi 1 1 ; 

— Z is the midpoint of (X, T): 

Z = mid{X, Y) := {xi + yi = 2zi,X2 + y 2 = 2z2) 

— {X,Y) and {Z,T) are equidistant: 

XY = ^:={xi- yif + (^2 - V2? = {zi - hf + (z2 - fa)" 

— The lines through (X,Y) and {Z,T) are parallel: 

XY//ZT ■= det =0 

As a first definition of degeneracy, we consider a component of the hypothesis 
variety non-degenerate if it is contained in the thesis variety; hence the test phase 
has to test that a component of the hypothesis variety exists that is contained 
in the thesis variety. An algebraic proof performs computations in the ideals 
defined by the hypothesis and thesis conditions to prove this fact. A stricter 
definition of non-degeneracy is considered in the construction protocol, that will 
be defined later. 

In the paper p], in these proceedings, five different “levels of truth” are 
considered; this classification can be seen as a combination of two criteria: con- 
sidering a theorem true if it is verified on every (on at least one) non-degenerate 
component; and defining every component non-degenerate, or use the definition 
of non-degeneracy of the construction protocol. His fifth level of truth (rarely 
true, when the set of points where the theorem is true is not empty) does not 
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match our definition of truth, and every theorem that can be translated into 
homogeneous algebraic equations in the coordinates is at least rarely true (being 
verified at least when all the coordinates coincide). 

After the test phase, one has to identify (algebraic) additional conditions (in 
the form of inequations, (j){X) ^ 0, where (f) is & polynomial in the coordinates) 
that ensure that the thesis is verified; these are called side conditions, and usually 
one aims at a geometric description of these conditions, e.g. in the form of a 
statement of the form are not aligned”. 

To obtain a degeneracy condition, it is sufficient to find a polynomial van- 
ishing on those components that are not contained in the thesis locus. This 
polynomial is of course not unique, and expressing it in an optimal way is the 
key of the success of the further steps of the protocol. Hence the logically true 
theorem that is obtained at the end is not unique. 



2.3 Construction Protocols 

A construction in dimension d is given by 

— a sequence V of indeterminate points (Pi, . . . , Pn), each one being a d-tuple 
of indeterminate coordinates; 

— for every i, 1 < i < n, a, set Ai^i,Ai^rn,i of geometrical conditions on the 
points Pi, Pi with the restriction that for every i, the corresponding 
polynomials in the coordinates of the Pi are algebraically independent on 
the field generated by the coordinates of Pi, ... , Pi-i- 

The case d = 2 is usual, but everything can be carried over unchanged to any d. 

The construction protocol requires us to express the hypothesis of a theo- 
rem in the form of a construction, and gives a corresponding definition of non- 
degeneracy. 

If P = (Pi, . . . , Pn) is a construction, let Pi = {Pi , . . . , Pi) be a sub-construc- 
tion; let P be the hypothesis variety, and Pi the corresponding hypothesis va- 
rieties for the sub-constructions Pi; a component P of P is non-degenerate if 
for every i the natural projection PL ^ Pi induced by the natural projection 
(Pi, . . . , Pn) — >■ (Pi , . . . ,Pi) is generically surjective on a component of Pi. 

The construction protocol is often specified asking us to identify a set of 
free variables; our definition is more intrinsic and invariant under change of 
coordinates; moreover one can include a rule to choose the free variables once 
the coordinates are fixed — for example choose the first such coordinates that 
are algebraically independent. This reduces the part on which human expertise 
is needed to a more geometric part, leaving the algebraic specification to the 
automatic phase. When free variables are chosen, one can define non-degeneracy 
by a projection on the free variables (testing that the variables are parameters 
in a component); this is clearly equivalent to our formulation. 

A construction theorem that is true on a non-degenerate component is often 
called generically true, on the ground that the construction is possible on the 
generic point of the parameter space; in our opinion the term is misleading, since 
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it cannot be applied outside of construction theorems, when there is no a-priori 
definition of the parameters (not even of the number of parameters, since the 
intuitively non-degenerate components may have different dimensions). And the 
non-degeneracy of a component may depend on the choice of the construction 
(we will show examples of both later). 

Transforming an informally stated theorem into a construction theorem is 
often hard, and sometimes impossible; in any case, human expertise is needed; 
see, for example, Wu’s ingenious specification for the Morley trisector theorem 
(0, p. 39; we remark, moreover, that the construction given there does not match 
the definition of construction given in the same book); we quote moreover from 
0, p. 21 (immediately after having shown how to prove a false theorem): 

“The experienced reader might immediately find that the second con- 
struction of the configuration is absurd. 

Our prover requires the user’s responsibility not to introduce “absurd” 
constructions similar to above. Though for the above absurd construction 
and more hidden ones, some heuristics are built into our prover to detect 
them and give the user warnings, we do not have a general definition of 
all “absurd” constructions. We require the user to take full responsibility. 

To specify construction sequence of a geometric statement of construc- 
tive type is actually to specify the exact meaning of the statement. Thus 
the above requirement of the user’s responsibility by our prover is rea- 
sonable, because the user should understand the meaning of the geometry 
statement before starting to prove if (our emphasis). 

We will show later that transforming an hypothesis into a construction might 
sometimes hide useful theorems. 

3 Paradoxes 

Paradox: a statement or proposition seemingly self-contradictory or ab- 
surd, and yet explicable as expressing a truth. m 

For example, Zeno’s paradox shows that if space is continuous, time should 
be continuous too; and Russel’s paradox explains that extensional definitions of 
sets should be bounded. 

The definition of “truth” of a geometric theorem (truth on a component) 
may have paradoxical consequences: it may happen that 

1. p ^ q is true, p r is true but p ^ q Ar is false; 

2. p g is true, g => r is true but p r is false; 

3. p g is true but p A r => g is false. 

The possibility of such paradoxes is obvious; for example in the first paradox 
it may happen that p is true on a component on which g is true and r is false 
and conversely, but g and r may be generically incompatible (compatible on a 
proper subvariety of the hypothesis variety). Nevertheless, this possibility is often 
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overlooked: for example, some protocols consider only theorems whose thesis can 
be expressed by a unique equation; but this is not always the case: for example, 
to express a thesis that a point is the midpoint of two other points, more than 
one condition is required, and testing the conditions separately is insufficient in 
view of the first paradox. 

The paradoxes as usual vanish with a proper formalization. When we state 
that “p q is true” our notation is ambiguous; if “is true” means “can be 
derived by a system of axioms” then we can use the modus ponens inference 
rule; if “is true” means “passes an algebraic test (on a component)” we cannot. 
The paradoxes are ruled out if one defines truth as truth on every non-degenerate 
component, but this throws away the possibility of proving several interesting 
theorems, like the above quoted Morley trisector theorem. 

A simple example of the third paradox is the following: given 4 points in 
the plane, Pi,P2,P3,P4, that constitute a parallelogram, (i.e. {P1P2/ /P3P4) A 
{P1P3/ / P2P4)) then the two diagonals meet in a point that is the midpoint of 
both diagonals. If we add the condition that Pi, P 2 , P 3 are aligned, the theorem 
becomes false, since the diagonals coincide, and a point where they meet can be 
any point on them. This example however does not correspond to a construction. 

For a construction theorem we require that the thesis is verified for a non- 
degenerate component. Hence a theorem may be true, but not true when con- 
sidered as a construction theorem; and the truth of a construction theorem may 
depend on the ordering of the set of points. 

Consider the construction (Pi, P2, P3, P4), with the conditions 



— col(Pi, P2, P3), P1P2 — P1P3; 

- P4 = mid{P 2 ,P 3 )- 

Both P4 = Pi and P4 = P2 are true, but (P4 = Pi) A (P4 = P2) is false. This 
happens whenever there is more than one component that is non-degenerate 
with respect to a construction. 

Consider now (Pi, P3, P2, P4) with the same conditions; this is still a con- 
struction, but now the component Pi = P3 is degenerate; hence the theorem 
P4 = Pi is false. Hence the truth of a construction theorem may depend on the 
ordering of the points. 

Consider a continuation of the construction with P^, Pq, Py, and the condi- 
tions: 



^ P1P5 — P4P5 
^ PiPq = P4P6 
- TW7 = W 

We have now two components, one with Pi = P4 and the triangle P^, Pq, P7 
non-degenerate, and the other with P2 = P4 and the triangle P 5 , Pq, P7 degener- 
ate; both components are hence degenerate or non-degenerate depending on the 
ordering. Moreover the component with Pi = P4 (degenerate if the constructions 
starts with Pi, P3, P2) supports the following useful theorem: 
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Theorem 5. Let P^,Pe,P'j he points in the plane, and let P\ he another point. 
// P 4 is a point in the plane having from P^,Pq,P'j the same distances as Pi 
then P 4 = Pi (unless P^,Pq,P 7 are aligned). 

A further simple reasoning gives for this construction (Pi, , Pf) a paradox 
of the second type: let q be the hypothesis given by the construction, let p be 
q A (Pi = P4), and let r be col{Pi,, Pq, P7); then p ^ q and q ^ r are true, but 
p r is false. 



3.1 Real Theorems vs. Complex Theorems 

There are several differences between theorems in complex Euclidean geometry 
and theorems in real Euclidean geometry. Some are evident: 

— theorems in real Euclidean geometry that use inequalities do not make sense 
on the complex field; 

— existence theorems true on the complex field might be false on the reals: e.g., 
“given three segments, a triangle exists having sides of the same lengths as 
the three segments”. 

It is less evident that there are purely equational theorems such that the truth 
on the real field does not imply the truth on the complex field, and conversely. 
Hence a purely algebraic proof can neither prove nor disprove a real theorem, 
unless we have additional results. 

In particular, the existence of purely equational theorems true over the com- 
plex field and false over the real field is a paradox: the hypothesis does not have 
quantifiers, all the variables are free, and adding the reality condition is hence 
simply adding an hypothesis — a paradox of the type p ^ qis true but pAr g 
is false, where r is the hypothesis that the free variables take real values. 

3.2 A Simple Example 

We give simple examples of theorems true on the real field and false on the 
complex field and conversely. 

Consider the following construction of 5 points: 

1. P3 is the midpoint of Pi and P2; 

2. P4 is the midpoint of Pi and P3; 

3. P5 is equidistant from Pi and P2, and has from them distance equal to the 
distance of Pi from P4. 

The thesis that Pi = P2 is true on the real field (and false on the complex 
field, it is true only on a degenerate component) since if Pi and P5 are different, 
then taking Pi,Ps as unit distance then Pi,P2,Ps constitute a triangle with 
sides of lengths 4,1,1, and this is impossible on the real field (but possible on the 
complex field) . 
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Continue the construction above with one more points as follows: 

1. Pe is equidistant from Pi and P2- 

The thesis that P3, P5, Pe are aligned is true on the complex field: P3, P5, Pq 
lie on the axis of Pi, P2; but it is false on the real field since the only possibility 
is that Pi and P2 coincide, hence the two last conditions are empty. 

The hypothesis variety on the reals has only one component, that corresponds 
to a degenerate component of the complex variety. 

The complex construction, while on the reals, should not be considered a 
construction, since conditions involving P5 imply a condition on Pi,P2; this 
implication is however not equationally algebraic, depending on an inequality 
(1 + 1 < 4 ); but a possible definition of real construction built on this consider- 
ation is quite difficult to verify. 

3.3 A Weird Example 

We now consider a configuration that is not a construction, and that shows that 
the decision on degenerate configurations is much harder: depending on the thesis 
one would choose a different component as non-degenerate. The configuration 
also shows an essential difference between the real and complex cases. 

This theorem only requires collinearity and parallelism, hence it is an affine 
configuration; it is also possible to build a more complex configuration with the 
same properties that only uses collinearity, i.e. with a projective configuration. 
Consider 8 points Pi, . . . , Ps with the following condition (called 83): 

Pi,Pi+i,Pi+3 are collinear (where P^+g = P*). 

The condition 83 is not a construction, and no construction can contain an 
83 condition, since every point is involved in three alignment conditions. 

A theorem of Maclane states that if the eight points are real and distinct, 
then they are collinear. 

On the complex field, the set of configurations has 10 components: 

— A component Hm of dimension 10 , composed of 8 aligned points. 

— A component Hq of dimension 8, in which the points Pi, P3, P5, P7 are arbi- 
trary, and for every generic choice of them one has two possible configurations 
for P2,P4,P6,Pg; if Pi,P3,P5,P7 are real, the two choices of P2,P4,P6,Pg 
are complex conjugate; the real points of Hg constitute 8 components iJgy 
of codimension 7 , composed of configurations in which 5 points coincide. 

— Eight components 1 < i < 8 of dimension 8, Hi is composed of config- 
urations in which P*+i = Pj+3, P*+2 = P^+7, Pi+5 = Pi+e, Pi+i, Pi+2, Pi+4 
and Pi+5 are aligned. Pi is arbitrary. 

See e.g. pang. In any case for a real configuration the points Pi, P3, P5, P7 never 
constitute a proper parallelogram. 

The theorem stating that Pi , . . . , Pg are collinear is true on the reals (the 
exceptional locus consists of the components Hi, Hg+, 1 < i < 8 in which the 8 
points are not distinct). 
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The theorem is intuitively false on the complex field, since the component Hg 
cannot be considered “degenerate” . But if we adhere to our definition of (alge- 
braic) truth, the theorem should be considered true: an example of an obviously 
false theorem that is true. 

Consider now points Pi, . . . Pg such that Pi, . . . Pg satisfy the 83 condition, 
Pi,P3,P5,P7 build a parallelogram, and Pg is collinear with Pi,Ps and Pz,Pt. 
The only possible real configuration has 9 aligned points (this can be checked 
considering all the possible configurations of the real theorem, see m)- 

The thesis that Pg is the midpoint of Pi and P5 is true on the complex field 
and false on the real field: the complex component that corresponds to a proper 
parallelogram has no real point (more precisely, its real points are contained in 
another component that is not contained in the thesis locus). 

4 Partial Conclusions 

Here are some of the partial conclusions that we want to draw at this point: 

— Testing a theorem is only a part of the way to the statement of a true 
theorem; the theorem that will be eventually proved may be unexpectedly 
different from the initial informal statement: we might end proving something 
that is so different from the intuitive meaning of the starting formulation 
that what we prove, although formally true, could be described as a “false 
theorem” . 

— Constructions are adequate sometimes, but not always; they are no guarantee 
that a real theorem can be derived from a complex theorem. 

— The result of the testing of a theorem depends from the protocol; not every 
protocol is suitable for every theorem. 

5 Proving Real Theorems: The Real Radical 

We have seen that testing a theorem is insufficient to determine a logically true 
theorem; in the rest of the paper however we will concentrate on the analysis of 
some methods of algebraic proof, and on the extension of these methods to the 
real ground field (or more generically to ordered fields). 

5.1 Algebraic Proof Tools 

We have seen that in an algebraic proof we have to show that a component of 
the hypothesis variety is contained in the thesis variety. 

Basic tools to identify components are transporter and saturation (see e.g. 

PUSj). 

Let I, J be ideals of a noetherian ring R; the transporter I : J and the 
saturation I :* J are defined as follows: I : J = {a £ R \ aJ C I}; I ■* J 
denotes the (constant) ideal I : J" for n sufficiently large. See [Z] for an account 
on algorithms for the computation of / : J and I * J . 
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Algebraically, I : J enlarges the primary components corresponding to associ- 
ated primes containing J, and I ■.* J deletes such components; if the ground field 
is algebraically closed, and i? is a polynomial ring, the associated components 
of the radical correspond to the irreducible components of the corresponding 
variety, hence computing I : J if J is radical, or computing I :* J in general 
corresponds to killing the components contained in the locus of J. 

To prove that a theorem is true, let H be the hypothesis ideal, and T the 
thesis ideal; one of the two following computations is sufficient. 

— Let H' be the radical of H. Prove that H' ■. T ^ H' . 

— Prove that H :* {H :* T) ^ (1). 

Indeed, let H be the locus of H and T be the locus of T ; the locus of H' : T 
is equal to the closure of "H \ T, and the locus of H :* {H :* T) is the closure of 
TL\{TL\T), hence coincides with the union of the components of TL contained 
in T- 

The first computation is much harder, but can be generalized more easily. 

When we are over the real field, we no longer have the correspondence be- 
tween associated prime ideals of the radical and irreducible components, hence 
the methods are not sufficient. 

The first method however can be straightforwardly generalized to the real 
case taking the real radical instead of the radical: the problem is that the com- 
putation of the real radical is much harder and less known. Eventually, in Section 
lb. hi we will generalize the second method too. 

Let K be an ordered field, and R its real closure. The real radical of an ideal 
I C K[X] is the largest ideal Vl that has the same zero set as / in i?. 

With this definition, the first criterion above can be generalized to the real 
field: 

— Let H' be the real radical of H. Prove that H' : T ^ H' . 



5.2 Semi-ideals, Theorems with Inequalities 

The notion of real radical can be generalized to semi-ideals, that generalize the 
concept of ideal taking into account inequalities. 

A sign condition is a predicate p#0 where p is a polynomial and # is one of 
>) >) <1 =) A sign condition is strict if # is >, < or 

A semi-ideal is (I, cti . . . , cr„), where I is an ideal and the at are sign con- 
ditions. A semi-ideal defines a locus (a semialgebraic set) and one defines its 
radical as the largest ideal J such that ( J, cti . . . , (t„) has the same locus. 

The considerations of semi-ideals allows us to introduce inequalities in the 
hypothesis of theorems; we do not investigate here the straightforward general- 
ization (with some extra difficulties one can introduce inequalities in the thesis 
too). 

Semi-ideals are also needed in the algorithms for the real radical, even in the 
case that that we only need to compute the real radical of an ideal. 
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5.3 Computation of the Real Radical: An Outline 

The real radical can be defined non-constructively using the theorem 

Theorem 6 (Real Nullstellensatz). Let I he an ideal o/R[A], and f G R[A] 
vanishing identically on the real locus of I; then 3m, n £ N, fi, . ■ ■ , fm G R[A] 
such that /^” + Si ^ 

The converse implication is of course true. This theorem, however, is non- 
constructive, as opposed to the Hilbert Nullstellensatz, which implicitly gives an 
algorithm to test radical membership; the algorithms for the computation of the 
radical cannot be mimicked to construct the real radical or to test real radical 
membership. One of the difficulties is the following: while the radical commutes 
with separable extensions (if J = '/l C K[X] and K' is a separable extension of 
K then J ® K' = ^/I ® K') this is not true for the real radical: the real radical 
of {x^ — 2) is {x^ — 2) or cc — \/2 depending on the existence of \/2 G K. 

Algorithms for the computation of the real radical have been discovered only 
recently; most of the algorithms in the literature are theoretical, and unfeasible 
even for very simple examples, see |^, and HHEO : the main concern of these 
paper is just to prove complexity bounds. See however also j2|, that describes a 
feasible algorithm for a special class of ideals. 

An algorithm that is more attentive to practical complexity than to theo- 
retical complexity is described in m see also mm for an important sub- 
algorithm with a slightly different approach. We outline the algorithm of H3|, 
which, following the classical algorithms of Morse theory , tries to minimize 
the use of operations that, while innocent-looking from the theoretical complex- 
ity point of view, may make unfeasible a computation that would be otherwise 
feasible. 

The computation of the real radical is much harder than the computation 
of the radical; as opposed to the computation of the radical, which requires an 
equidimensional decomposition, it requires an irreducible decomposition; the key 
ingredient is the following theorem: 

Theorem 7. Let V he a complex algebraic variety, defined and irreducihle over 
the real field. If V has a real point P that is non-singular as a complex point 
then the set of real points is Zariski dense in V , hence the defining ideal of V 
coincides with its real radical. 

The proof of the theorem is simple and classical: for the implicit function 
theorem the fixed locus of the complex involution at the point P G V has real 
dimension equal to the complex dimension of V ; hence the set of real points is 
Zariski-dense. 

A common feature of most algorithms for the real radical is the following: 
decompose a semialgebraic set into subsets for which the defining ideal is easy 
to compute; then take the intersection of these ideals. 

The basic idea of our algorithm is the following: decompose V into compo- 
nents Vi irreducible over the ground field, (more explicitly, find a prime decom- 
position of the defining ideal) and let Si be the corresponding singular locus 
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(algebraically defined: the locus where the rank of the Jacobin ideal of a set of 
generators is not maximal); if V) \ Si is empty, replace V) with an irreducible 
decomposition of Si. At the end we get a decomposition of the real locus of V 
into irreducible algebraic varieties whose defining ideal coincides with its real 
radical. The real radical of the original ideal is the intersection of these radicals. 

To prove emptiness of Vi \ Si it is sufficient to prove the emptiness of Vi\S'i, 
where S’' is any subvariety of V of codimension 1 containing 5^, since if the real 
locus is contained in a 1-codimensional subset then by Theorem 0 it is contained 
in the singular locus. 

Vi \ S'i is defined by a semi-ideal. The proof thus reduces to a test of emptiness 
for semi-ideals; this can be split in two parts: 

1. Prove that the locus of the semi-ideal is compact; 

2. Prove that a function has no critical point (no point where the differential 

vanishes) . 

Since the locus of the critical points is recursively of smaller dimension (unless 
the variety has a very special position, to be handled separately) the emptiness 
can be handled recursively, reducing to dimension 0, where one can use methods 
for real root counting, see e.g. |24liVj . 

The function can be any polynomial; we take a coordinate function, that 
has the advantage of being linear, hence does not increase the complexity of 
the calculations; if the semi-ideal is an ideal (no inequalities), a more classical 
alternative (see mm and also m) takes a non-negative function (a sum of 
squares) thus avoiding the need of proving compactness. With classical methods 
(Rabinowitz trick) one can transform inequalities into equalities, but at the cost 
of adding a variable and a non-linear equation, hence threatening feasibility. 

To prove the compactness of a semialgebraic set S defined by a semi-ideal 
I — (/, CTi . . . , cr„), one can prove that it is closed in the projective space, and 
for this it is sufficient to prove that it is closed in every affine chart. 

Given a semi-ideal I = (/, tJi . . . , cr„), define Ij = (/, cti . . . , cr„, aj), where 
cTj is defined as follows: let e be a positive infinitesimal, i.e. a new indeterminate 
added to the ground ordered field K, positive and smaller than any positive 
element of K; then: 

— if CTj is gj > 0 then aj is gj = e; 

— if CTj is gj < 0 then aj is gj = — e; 

— if aj is gj ^ 0 then bj is gj = ±e; 

— in every other case bj is 1 = 0, i.e. Ij has empty locus. 

One proves that if every Ij defines an empty semialgebraic set, then S is 
closed, and if S is compact then every Ij has empty locus. Hence closedness is 
recursively reduced to emptiness. This proof is the key step of the theorem; since 
it mixes topological arguments with non-Archimedean arguments (use of the 
infinitesimals), the Tarski-Seidenberg quantifier elimination theorem is largely 
used; but this is only for the proof, no quantifier elimination is needed for the 
algorithm, hence the efficiency is not impaired. 



98 



Pasqualina Conti and Carlo Traverso 



Two remarks on combinatorial complexity: 

— Every gj ^ 0 gives rise to a gj = ±e, that can be reduced either to g"^ = 
or to two disjoint cases gj = e and gj = —e. The best way is the second, 
although it appears of exponential complexity: the two cases gj = e and 
gj = — e are algebraically identical, the difference being only in the real root 
counting phase, that can anyway be handled simultaneously for all the signs 
of the added e. Adding a quadratic equation instead doubles the degree, 
increasing the complexity of all the subsequent steps. 

— There is an element of exponential growth in the recursive handling of the 
affine charts; however the computations are anyway exponential in the num- 
ber of variables in average case complexity, and even doubly exponential in 
worst case complexity, and the recursion reduces the number of variables 
cutting with a hyperplane, hence the most time-consuming parts of the al- 
gorithm are the top-dimensional ones. 

Moreover, in general, for evaluating the practical performance of an algorithm 
requiring computations with polynomial ideals, one has to remember that the 
limiting factor is usually space, not time, and computing a very large number of 
small problems may succeed, while a single slightly larger problem may fail for 
space problems. 

A full description of the algorithm is in H3| 

5.4 A Simpler Algorithm for the Proof of Real Theorems 

To (algebraically) prove a theorem it is not necessary to compute the real radical: 
we do not need to identify all the components of the real locus, we only need to 
prove that a component is contained in the locus of the thesis. As soon as this 
is achieved, we can exit the algorithm. 

It is hence possible to avoid some of the more difficult points of the algorithm, 
in particular to avoid finding an irreducible decomposition (an equidimensional 
decomposition is sufficient, and is anyway needed for the computation of the 
radical). And we do not need to investigate smaller dimensional components 
when we have identified one component with smooth real points. 

The algorithm strictly follows the real radical algorithm, using unchanged 
some of its sub-algorithms, and simplifying it at some points. We leave the 
details to a further paper. 

5.5 A Criterion to Pass from a Complex Proof to a Real Proof 

From the algorithms for the computation of the real radical, in particular The- 
orem 0 it is clear that the following criterion holds: 

Theorem 8. Let T-L be the hypothesis variety, and T the thesis variety of an 
algebraieally true eomplex theorem. Let H be the Zariski closure ofH\T- Assume 
that P is a smooth real point of the Zariski closure of'H\'H. Then the theorem 
is algebraically true on the real field. 
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Indeed P is contained in a component of 'H that is contained in T ; this 
component, having a smooth real point, is a component of the real radical. 
Hence the real theorem is algebraically true. 

We remark that it is not sufficient to prove that P is smooth in TL and 
contained in T, since it may happen that P is a smooth point of a component 
on which the thesis is not generically true; nor it is sufficient to find a point 
of a component of the complex variety where the theorem is true, since it may 
happen that this complex component has a real locus fully contained in the 
intersection with another component having real points on which the theorem is 
false. 

A simple example for both remarks is the following: let H = (xz, y^z + = 

(x, + z^) n(z), T = (x). We have ^ (x, y^ + z^) = (x, y, z), \/(i) = (z) hence 

= (z), hence the complex theorem is true and the real theorem is false; but 
(0, 1, 0) is a smooth point where the thesis is true, and (0, 0, 0) is a real point of 
(x,y^ + that is the component of the hypothesis variety where the theorem 
is true. 

To check the hypothesis of Theorem 0 one has to find a defining ideal of 
7i\'H; one such ideal in the algebraically closed case is H :* {H :* T). This proof 
criterion can hence be seen as a generalization of the second proof criterion in 
Section lb. IL 

Usually, it is fairly easy to provide a real point P G TL if the theorem is true 
over the real field: it is just an example of the theorem. Checking the smoothness 
is just a rank computation of the Jacobian matrix at the chosen point. We have 
moreover to check that P is a point of a complex component contained in P ; this 
corresponds to checking that the maximal ideal corresponding to P contains the 
ideal H :* {H :* T). 

If in every component on which the thesis holds we do not have a smooth 
real point, the truth of the theorem on the real field remains undecided: it may 
happen that a real component of I contained in the singular locus is contained 
in the thesis locus. The real theorem will be true if and only if this component 
is not contained in a larger real component of H . Here are two examples: 

— for the configuration of Id. 21 all the real configurations are contained in the 
singular locus (since otherwise they would be dense, but we have seen that 
for them we have Pi = P 2 , that is not verified for the generic complex point). 
The thesis that Pi,P 2 ,Pa are aligned is true without exceptions; 

— the configuration of is an example of the other possibility, since the 
real part of the component of the complex variety satisfying the thesis is 
contained in another component, composed of 9 aligned points, for which 
the thesis is not verified. 

To decide the issue when a smooth point is not available (either it does 
not exist or we have been unable to find it) we can proceed with the following 
(probabilistic) algorithm: 

— compute I = H :* (H :* T) (that is an ideal defining TL\TL); 
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— find g G H :* T, g ^ H, i.e. a polynomial vanishing on "H but not identically 
vanishing on 'H; 

— consider the semi-ideal I = {H,g ^ 0) and prove that the locus of I is 
non-empty. 

This proves the existence of a real point P not contained in any component 
where the theorem is generically false; hence in every component containing P 
the theorem is true; but this neither provides an example nor proves that an 
algebraically smooth example exists; it is however sufficient to test the theorem. 

The algorithm can be made deterministic by repeating it for g va. a, set {gi\ 
of generators of H :* T. In that case if every {H,gi ^ 0) is empty we have a 
proof of the falseness of the theorem. 

To summarize, the test method outlined in this section can be viewed as a 
generalization of the second criterion in Section o after proving that the ideal 
I = H :* {H :* T) is not trivial, to test the theorem on the real field one can 
either prove that a smooth real point exists, or prove that a real point that is 
not a point of H :* T exists. In the test of a complex theorem, the second part 
is unnecessary because of Hilbert’s Nullstellensatz. 

6 A Protocol Proposal: Theorems with Examples 

The real proof criterion of Theorem 0 suggests a slightly different specification 
for configurational theorems; consider an ideal H generated by a set of configura- 
tional geometric equations, and let 7^ be a configuration satisfying the conditions 
(i.e. in the locus Ti of H), an example. If a point V is smooth, (i.e. the local- 
ization of C\X]/P[ at P is a regular local ring) then a unique component of the 
hypothesis variety contains P , and this component is declared non-degenerate. 
More generally, we may give a set Vi of examples, and consider non-degenerate 
any component containing one of the smooth examples. The test points may also 
be given approximately, provided that they are far from the singular locus and 
near to an unique component: this can be tested numerically. 

We remark that giving a smooth point allows us to find a set of parameters: 
a choice of coordinates exists such that the projection at the smooth point is 
a surjective diffeomorphism, and we can take these coordinates as independent 
variables, the other being dependent (on the component under consideration). 
This generalizes the usual construction protocol, but only on an a posteriori ba- 
sis: the existence of the independent variables is a conclusion, not an hypothesis; 
and if several test points are given, the set of independent variables may vary 
for different components (they may even be of different cardinality — the 83 
configuration is an example). 

Theorems in which a unique smooth example of the hypothesis is given are 
unambiguous, and all the paradoxes discussed above are solved. The praxis of 
exemplifying theorems with a drawing is moreover current in Euclidean geom- 
etry; a drawing is a smooth example, since (and if. . . ) it is approximate: the 
points can be slightly moved provided that the geometric relations indicated by 
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the lines and circles included in the drawing are maintained; this movement re- 
alizes the representation of a Euclidean neighborhood of the example, showing 
the smoothness. We remark that this protocol is already implicitly implemented 
in the system Cinderella m when you draw a construction this corresponds to 
an example; the deductions made by the system correspond to finding a thesis 
that is valid on the component of the construction defined by the example, since 
the movements used to find the random examples on which the hypothesis is 
tested avoid the singular locus, hence remain on the same algebraic component. 



6.1 The Real Case: Further Extensions 

In a theorem with example, if the complex theorem is true, and the example 
point is real, the real theorem is true too. 

In the real case it may be possible, following these ideas, to develop proof pro- 
tocols for theorem with examples that require inequalities in the side conditions 
(hence are algebraically false). 

Consider for example the following theorem: 

Theorem 9. In a triangle, the radius of the in-circle is not greater than half 
the radius of the circumcircle. 

On the complex field the theorem cannot be stated algebraically since it in- 
volves inequalities; if one allows complex conjugation the theorem can be stated, 
but is false since one cannot tell the difference between the four circles that are 
tangent to the three sides of a triangle (it is impossible to distinguish between 
the inside and the outside of the triangle, hence from in-circle and ex-circle). 

The theorem is however true on the reals. In that case, the interior of the 
triangle can be identified (it is the set of points where the functions defining 
the lines of each side have the same sign as the opposite vertex). Hence the 
inscribed circle is identified as the circle tangent to the three sides that has the 
center inside the triangle. 

In the set of (non-degenerate) configurations of the variety of tritangent cir- 
cles, the inscribed circles are not an algebraic component; but indeed they are 
a connected component: the degenerate triangles constitute a subvariety of the 
non-degenerate component, and are non-smooth points (belonging to two dif- 
ferent components, the non-degenerate and the degenerate). The set of config- 
urations corresponding to inscribed circles is a connected component of the set 
of smooth points (to pass continuously from an in-circle to an ex-circle one has 
to pass through a degenerate triangle; and conversely one can pass continu- 
ously from one non-degenerate triangle to another). An algebraic proof (indeed, 
a semialgebraic proof. . . ) is possible: if one considers the semialgebraic set of 
smooth points of the configuration, the in-circle and the ex-circles lie in differ- 
ent connected components, since the degenerate triangles constitute a different 
algebraic component, (one for which infinitely many circles tangent to the three 
coinciding sides exist), and one can pass continuously from an in-circle to an 
ex-circle only through a degenerate triangle or a complex triangle. The example 
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indicates the correct connected component, that is defined from the properties 
that a linear function vanishing on any two vertices has the same sign on the 
third vertex and on the center of the circle (i.e. the center of the circle is internal 
to the triangle) . 

Consider the following “theorem”: 

Theorem 10. In a triangle, the radius of one of the ex-circles is smaller than 
the radius of the circumcircle. 

An automatic semialgebraic proof correctly and optimally completing the 
hypothesis with a set of inequalities from a true example remains a challenge 
(the theorem is true e.g. if one angle is obtuse, for the ex-circle opposed to the 
smaller angle, and is false in a neighborhood of a regular triangle). 
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The intelligent reader elearly understands 
that the work is artificial in its essence. 

(Anonymons) 



Abstract. The mathematical literature related to automatic methods 
for proving theorems in Euclidean geometry is immense. However, it is 
the opinion of the authors that the theory behind this topic wonld profit 
from more algebraic tools and more methods from commutative algebra. 
The scope of this paper is to begin to fill such a gap. In particular we 
bring to the forefront important notions such as computing field, optimal 
hypothesis ideal, and good set of conditions. 



1 Introduction 

As is well explained in the survey paper [6], the dream of deducing all the 
theorems of a specific sector of mathematics from a small set of axioms has 
fascinated many mathematicians not least of which David Hilbert. The dream 
did not come true due to the fundamental work of Kurt Gddel, from which it 
became clear that it is not possible to construct a machine which proves all 
possible theorems. However, the door was left open to the possibility of using a 
machine to prove some theorems. 

The idea of constructing such a machine was revived by the appearance 
of electronic computers, and the goal of proving theorems automatically soon 
became a central part of that academic discipline, with the awe-inspiring name 
of Artificial Intelligence. 

Already in the early sixties, Gelernter, Hansen and Loveland in [5] introduced 
a “Geometry Machine” with the purpose of proving theorems in Euclidean ge- 
ometry. A major breakthrough in the subject was the work of Wen-tsiin Wu 
(see for instance [15-17]) and his coworkers of the Ghinese school. An impressive 
amount of material on the subject was collected by Shang-Ghing Ghou in the 
book [3]. 

Another source of inspiration was the advent of Grobner basis theory, due 
to Bruno Buchberger (see for instance [1,2]), and which inspired many funda- 
mental contributions, such as those of Deepak Kapur (see [7] and the important 
paper [8]), Bernhard Kutzler and Sabine Stifter [10], and more recently Tomas 
Recio with his coworkers [12, 11] and Dongming Wang (see for instance [14]). 



J. Richter-Gebert and D. Wang (Eds.): ADG 2000, LNAI 2061, pp. 104-128, 2001. 
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In this brief overview of the history we have clearly skipped over many im- 
portant paths and omitted many authors. Fortunately, a huge list of papers on 
the subject is contained in the web site managed by Dongming Wang at 

http : / / calf or . Iip6 . f r/"^wang/ GRBib/Welcome . html 

This reference is of fundamental importance for the adventurous reader who 
wants to navigate inside the huge quantity of work done on the topic of proving 
theorems automatically. 

The purpose of this paper is to contribute to the subject in many ways. First 
of all, we want to look at the fundamentals of the theory on a purely algebraic 
basis, where ideal theory provides the necessary devices. Along the way, we 
introduce important notions such as computing field, optimal hypothesis ideal, 
and good set of conditions. 

We also provide an implementation of our method in the system CoCbA 
(see [4]), which uses Grobner bases. But, since we mainly concentrate on the rep- 
resentation of knowledge, the tools for proving (Grobner bases, Wu-Ritt bases or 
other devices) are considered to be of lesser importance. Good sources of inspi- 
ration for us were the papers [8, 11], where an algebro-geometric approach was 
taken via the celebrated Nullstellensatz of Hilbert. 

Before giving a more detailed description of the content of the paper at the 
end of the introduction, we want to supply some background to help appreciate 
the novelties of our approach. 

We recall that the main step in proving a theorem automatically is to con- 
struct an explicit representation of its hypothesis and thesis. This should be done 
in such a way that the thesis follows from the hypothesis with the help of a set 
of rules, and both the representation and the rules can be manipulated by a 
computer. 

Given a theorem in Euclidean geometry, the fundamental steps usually con- 
sidered are: 

a) introduce Gartesian coordinates in the Euclidean plane or space; 

b) translate the hypotheses and thesis into algebraic relations among the funda- 
mental geometric data such as coordinates of points and lengths of segments; 

c) assuming that these algebraic relations are expressed as the vanishing of 
suitable polynomials, prove the theorem by showing that the “thesis poly- 
nomial” is a consequence of the “hypothesis polynomials” . 

All of these steps require some clarification in order to avoid critical obstacles. 
But before raising the many questions related to the concept of automatically 
proving theorems, let us say that even the traditional methods are not totally 
reliable, as the following beautiful example shows. We learned of and borrowed 
it from the book [3]. 

Example 1 (A Fake Theorem). Every triangle is isosceles. 

Proof. Let ABC be a triangle as shown in the picture. We want to prove that 
CA = CB. Let D be the intersection of the perpendicular bisector of AB and the 
internal bisector of angle ACB. Let DE T AC and DF T CB. It is easy to see 
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that CDE ^ Z\ CDF and Z\ ADE ^ BDF. Hence CE + EA = CF + FB, 
i.e. CA = CB. 



c 




What is wrong with this proof? If you feel a little nervous about this example, 
or you are simply surprised by this apparent paradox, please relax and you will 
discover the trick. Mathematics is full of difficulties and also the validity of many 
theorems relies more on general consensus than on the logical correctness of the 
steps of a formal proof. It is not our intention to delve into these philosophical 
oddities, we merely want to show that the idea of automating proofs leads to a 
fair number of new and subtle problems. 

To begin with, we say that it is practically almost impossible to automatize 
the entire process. For instance the choice of the coordinates and some clever 
simplifications of the input data are generally part of a preprocessing, which 
comes before the automatic part is left to the computer. 

Now, to highlight some of the problems we are going to encounter, we discuss 
a number of examples. 

Example 2 (Heron’s Formula). In any given triangle the area s is given by = 
p{p— a){p—b){p— c) , where a, b, c are the lengths of the sides, and p = (a+6+c) /2. 




Can we deduce it automatically? The answer is yes and the idea is the following. 

We introduce an orthogonal system of coordinates, we use the possibility of 
freely choosing the system, and position the triangle as shown in the picture. 
Then we use Pythagoras’ Theorem to show that 6^ = (a — 
and we observe that 2s = ay. 
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We construct the ideal I = {b^ — {a — x)^ — 2s — ay) 

in the polynomial ring P = R[x, j/, a, 6, c, s], we look for a polynomial relation 
among the indeterminates a, b, c, s and we try to deduce it from the given set of 
relations. 

One approach is to compute I nlR[a, b, c, s]. It works and produces a principal 
ideal, whose generator can be rewritten as Heron’s Formula after the substitution 
p = {a + b + c)/2. 

Example 3 (Equal Cubes). Cubes of equal volume have equal sides. 

This absolutely trivial theorem is not provable by using a method indepen- 
dent of the base field, simply because it is false over the complex numbers. Unlike 
the case of Heron’s Formula, where we want to deduce an algebraic relation from 
others, we could content ourselves to check that the thesis polynomials vanish 
on the common zeros of the hypothesis polynomials. A first big difficulty arises. 
The Nullstellensatz translates this vanishing into an algebraic verification only 
in the case of algebraically closed fields. 

The thesis polynomial x — y does not follow from the hypothesis polynomial 
x^ — y^, in the sense that x — y ^ a/ (x^ — y^). Then we observe that x^ — y^ = 
(x — j/)(x^ + xy + y'^) = {x — y){x — aiy){x — a 2 y), where «i, 02 are the two 
conjugate complex roots of the univariate polynomial x^ -I- x -I- 1. 

This means that the hypothesis ideal has three complex components, two real 
components, but the hypothesis variety has only one real component, which is 
the thesis variety. Therefore the statement is true, and hence a theorem, over 
the reals but it should be considered false over the complex numbers, since, for 
instance 

2^ = (-1-H 

Example 4 (Bisecting Diagonals). The diagonals of a rectangle cross each other 
at their midpoint. 




In the picture the two diagonals of the rectangle ABCD are AC and DB, while 
O is their point of intersection. We have to show that AO = OC and DO = OB. 
We proceed as follows. We introduce an orthogonal system of coordinates as 
shown in the picture and we name the coordinates of the relevant points by 
putting 



A = (0,0), H=(xi,0), C'=(xi,X 2 ), T>=( 0 ,X 2 ), 0=(x3,X4) 
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In this way we have already expressed that ABCD is a rectangle. The only 
hypotheses still to be translated are that O belongs to the line AC and to the 
line BD too. From elementary considerations we get 

hi = X4X1 — X3X2 and /12 = 3^12:2 — a:2a;3 — a;ia;4 

So we produce two hypothesis polynomials, hi,h2, whose vanishing represents 
the two described conditions. Analogously we produce the thesis polynomials, 
ti,t2, whose vanishing represents the property that AO = OC and BO = OD. 
We get 



ti = x\ + x\ — 2xiXz — 2x2X4 t2 = x\ — x\ — 2 x\X3 + 2x2X4 

We have already seen that the Nullstellensatz translates the vanishing into an 
algebraic verification only in the case of algebraically closed fields. However a 
second important difficulty shows up. No matter over which field we work, the 
statement is false. 

Indeed, it is well known that many theorems from Euclidean geometry turn 
out to be false in the algebraic setting, simply because the algebraic translation 
of some condition can encode more cases. This certainly happens in the case of 
the bisecting diagonals, but we can exhibit an even simpler example. 

Suppose we want to prove the following apparently trivial theorem. 

Let r be a straight line in the Euclidean plane, A,B G r, and C a point in 
the plane. Then C is aligned with A, B, if and only ii C G r. 

We may try to prove the theorem in the following way. We let r be the x-axis 
and A(0, 0) the origin. Then we let B{x, 0) be a point on the x-axis and C{a, y) 
be a point on the plane. What is the condition which expresses the alignment 
of A, B, Cl A first look at the problem suggests that the answer is y = 0. 
But of course there is a more complete answer, namely xy = 0, which takes 
in consideration also the case A = B. Indeed in such a situation every point 
is obviously aligned with A and B. If we really want to consider only the case 
Afi^B, then we need to disregard the case x = 0. This can be done by saturating 
the hypothesis ideal I = (xy) with respect to x. The new hypothesis ideal is now 
(y), and since the thesis polynomial is clearly y, the wanted conclusion follows 
trivially. 

Looking from the point of view of algebra, the hypothesis ideal / = (xy) 
and the hypothesis ideal J = {y) are different ideals. And the consequence is 
dramatic. Namely the statement is true with the hypothesis ideal J, but false 
with I. 

Many authors have already pointed out that it is not always easy to make 
a full a priori analysis of the degenerate cases of a geometric construction (see 
Example 8). But even more important is the consideration that the radical of 
the hypothesis ideal need not be prime. It is perfectly legitimate to consider a 
statement of the type: “Let T be a triangle which is isosceles or right angled. 
Prove that . . .” In this case, even if we disregard degenerate components, the 
hypothesis ideal has at least two prime components. Following this approach, we 
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are naturally lead to consider statements which are algebraically true on some 
prime components of the radical of the hypothesis ideal (see Definition 5) . 

Now we have seen some of the problems, so let us have a closer look at the 
content of the sections. 

In Section 2, following for instance the approach of [8], we restrict ourselves 
to proving or disproving algebraically true statements. In this way we can stay 
within the realm of commutative algebra, since the Nullstellensatz allows us to 
formulate the definition of algebraically true statements in terms of the radical 
of the hypothesis ideal (see Definition 1). 

However, the ring on which we consider our data is a polynomial ring over 
the reals, and it is well-known that M is not a computable field. With the aid of 
the notion of field of definition we define a computing field for a given statement 
(see Definition 3). It will play a fundamental role in the theoretical aspect of our 
approach. A first result is that if AT is a computing field for a statement T, then 
the validity of T can be expressed completely in a polynomial ring over K (see 
Thereom 2). 

A careful study of the hypothesis ideal in the case of the intriguing Example 4, 
shows that some statements are true on more than one component. In fact, the 
final part of Section 2 is devoted to giving a graphical representation of the space 
of components of our hypothesis. This space is closely related to what geometers 
call realization spaces (see for instance [13]). 

More algebra is needed to lay down the theoretical basis, and Section 3, which 
is the heart of the paper, takes care of that. After recalling or proving some easy 
facts form ideal theory, we define the condition ideal (see Definition 6) and we 
prove Theorem 3 and Corollary 2, where all the facts about a statement in 
Euclidean geometry are expressed in purely algebraic terms. In particular we 
concentrate on the intrinsic notion of optimal hypothesis ideal (see Definition 8) 
and on that of almost good and good set of conditions (see Definition 11). These 
are central notions, and Proposition 2 describes a way of computing almost good 
and good sets of conditions. 

Then the final section treats the algorithmic part of the paper. It states and 
proves the main algorithm (see Theorem 4) and describes the CoCbA code which 
can be freely used. With the help of CoCoA we illustrate some examples which, 
in different respects, illustrate some of the subtleties encountered in the paper. 

2 Algebraically True Statements 

As a first step we follow a classical approach, and define the notion of an alge- 
braically true statement. 

Definition 1. Let T he a statement in Euclidean geometry, whose hypothe- 
ses and thesis are expressed by the vanishing of polynomials h\,...,hr,t in 
K[a;i, . . . , Xn] . We assume that the ideal {hi, . . . ,hr) is proper. Then it is called 
the hypothesis ideal o/T, and denoted by T{T). The polynomial t is called the 
thesis polynomial. We say that the statement is algebraically true iftG i/TffT). 
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Likewise, we say that the statement is algebraically false if t ^ y^TlfT). If there 
is no ambiguity, we simply say that a statement is true or false. 

The first simplification is that we are only going to discuss how to give 
automatic proofs of algebraically true statements. 

The next fundamental step, which is most of the times overlooked in the 
literature, is to be able to do all the field computations inside a field which is 
computable. What does it mean that a field is computable? Roughly speaking, it 
means that it is possible to store an element of the field in finitely many memory 
cells of the computer, that one can check in finitely many steps whether two such 
representations correspond to the same field element, and there are algorithms 
for performing the four basic operations +, — , x,-L. 

The problem is that K is not computable. However, we may use the notion 
of field of definition, which we recall briefly (see [9] Section 2.4 for more details). 

Definition 2. Let K be a field, P = K[x \, . . . , x„] a polynomial ring, and I an 
ideal in P. 

a) Let k Q K be a subfield. We say that I is defined over k if there exist elements 
in k[xi , . . . , x„] which generate I as a P-module. 

b) A subfield k C K is called a field of definition of I if I is defined over k and 
there exists no proper sub field k' C k such that I is defined over k' . 

Theorem 1. Let I be a non-zero ideal in K[x \, . . . , cc„]. 

a) There exists a unique field of definition of I. 

b) Given any term ordering a, let G be the corresponding reduced a-Grobner 
basis of I. Then the field of definition of I is the field generated over the 
prime field of K by the coefficients of the terms in the support of the vectors 
in G. 

Proof See [9], Section 2.4. 

Corollary 1. Let fi, . . . , fr be polynomials in the ring . . . , Xn] and let I 

be the ideal (/i, . . . , fr). Then the field of definition of I is a finite extension of 
Q, hence it is computable. 

Proof. The claim follows immediately from the theorem, since the prime field of 
K is Q, and a reduced Grobner basis contains finitely many polynomials. 

Definition 3. Let be given a statement T in Euclidean geometry, whose hypo- 
thesis and thesis polynomials h\, . . . ,hr,t lie inM.[xi, . . . , Xn] . A field K is called 
a computing field for T, if it is finitely generated over Q and contains the field 
of definition of the ideal (I(T),t) = {hi, . . . ,hr,t). It is a computable field. 
The ideal generated by {hi, . . . , hr) in K[xi, . . . , Xn] will be denoted by Ik{T) or 
simply I{T) or even I, if no confusion arises. 

Definition 4. Let R be a commutative ring, let I and J be ideals in R. Then 
the set 

I : J°° = (J / : J® = {r G i? I J* • r C / for some f G N} 

iGN 

is an ideal in R. It is called the saturation of I by J. 
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Now we need an extra piece of information. In the following, if itT C L is 
a field extension, and / is an ideal in K[xi, . . . ,x„], we denote by I® the ideal 
generated by / in L[a;i, . . . , x„]. 

Proposition 1. Let K C L he a field extension, I, J ideals in K[x\, . . . ,Xn]- 
Then: 

a) n K[xi, . . . , Xn] = I- In particular, we have 1 G P if and only if 1 G I. 

b) For I = we get 7® = . 

c) We have 7® : (J®)°° = (7 : J°°)®. 

d) If moreover K is perfect, then -\/7® = (-\/7)®. 

Proof. Claims a),b), c) are standard in commutative algebra and depend on 
the faithful flatness of the homomorphism K[xi, . . . , x„] = — > L[xi, . . . , a;„]. The 
proof of claim d) is based on its 0 -dimensional version, namely on the fact that 
if 7 is a 0-dimensional radical ideal and K is perfect, then 7® is also a radical 
ideal. The proof of this fact can be found in [9], Proposition 3.7.18. 

With the help the following example we illustrate the importance of the 
notion of the computing field. 

Example 5 (Golden Ratio, Golden Section, Golden Mean). 




Let AB be a segment as shown in the picture. Let CB be perpendicular to AB 
and such that the equality AB = 2CB holds true. Let D be the intersection of 
AC with the circle centered in C and passing through B. Let if be a point on 
AB such that AE = AD. 

We have to show that AB, AE satisfy the golden ratio, i.e. ^ . 

We introduce an orthogonal system of coordinates as shown in the picture. 
We get 

A = (0, 0), B = (xi, 0), C = (xi, y), D = (x 2 , X 3 ), E = (x 4 , 0) 

In this way we have already expressed that CB is perpendicular to AB and that 
AB = 2CB. It remains to express the facts that D belongs to the line AC, that 
CD = CB, and that AE = AD. We get the following hypothesis polynomials: 

hi = —-X 1 X 2 + X 1 X 3 /l 2 = — X^-|- 2 xiX 2 — X 2 -I-X 1 X 3 — X 3 ft -3 = — X 2 — X 3 -I-X 4 
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The thesis polynomial is 

t = -\/5a;4 — 2xi + CC4 

Therefore a computing field is Q(-\/5) and we may perform all the computations 
in Q(A/5)[a;i,a;2,a;3,X4]. 

An alternative way of posing the problem is to require that AE is the golden 
section of AB, i.e. that ^ The thesis polynomial is 

t = x\ — 2x\ X4 + x\x\ — x\ 

Therefore a computing field is Q and we may work in Q[a;i, X2, X3, X4]. We will 
see later (see Section 4) that the two approaches lead to different computations. 

In the following, we are only going to use the operations on ideals described 
above. This implies that all the theory and the computation can be carried on 
in the ring P = K[x\, . . . , x„], where if is a computing field for P. Since K is 
computable, the procedures which we are going to develop in the paper are true 
algorithms. 

For the moment, we content ourselves to get the first step toward the autom- 
atization of the proof of theorems in Euclidean geometry. 

Theorem 2. With the above assumptions, let K he a computing field for T. 
Then the following conditions are equivalent. 

a) Statement T is algebraically true. 

b) The thesis polynomial t is in ^ i/c(T) . 

Proof. Let us prove the only non trivial implication, i.e. a) b). Let P be alge- 
braically true. Then there exists d G N such that G {hi, ... , dr)M[a:i, . . . , x„]. 
But G K[xi , . . . , Xn], hence G Ik{P)^[xi, ■ . ■ , x„] fl K[xi , . . . , x„], and the 
latter is Ik{P) by Proposition l.a. We have checked that t G y/ 1 k{P), which 
concludes the proof. 

After the above discussion, we assume that for every given statement P we 
work over a fixed computing field K for P, and we call Ik{P) the hypothesis 
ideal ofP. 

Unfortunately also this simplification is not enough to avoid further diffi- 
culties. Let us go back to the Bisecting Diagonals statement. We saw that the 
hypothesis ideal is / = (0:4X1 — X3X2, X1X2 — X2X3 — X1X4). We take the thesis 
polynomials ti,t2 and check if they belong to '/l. The answer is no. Should 
we conclude that the Bisecting Diagonals statement is not a theorem? Are we 
facing the same difficulty as in the example of the equals cubes? 

Let us have a look at another example. 

Example 6 (Trisection of Line Segment). The two straight lines passing through 
a vertex of a square and the medium points of the opposite sides cut the opposite 
diagonal into three equal parts. 
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We introduce Cartesian coordinates, and place the square as in the picture. Then 
we have 



A={1,0), B={l,l), C={0,1), M = (1/2,0), N= (1,1/2) 

Furthermore we introduce a new indeterminate x, we work in the polynomial 
ring Q[/,a:], and we have 



P = (x,x), Q = (I — x,l — x) 

In this way we have already expressed that P and Q belong to the line OB, and 
that OP = BQ. The only hypotheses still to be expressed are that C, M,P are 
aligned and Q belongs to the line ON. Both lead to the polynomial h = P — 3lx, 
hence the hypothesis ideal is the principal ideal I = (h). The thesis comes from 
expressing that OP = PQ, hence t — P — 4lx + A computing field is Q. 
We check if t G \/l, the answer is no, and again we must conclude that the 
statement is algebraically false. 

And now we have to face the problem of discovering why this statement turns 
out to be algebraically false. In this case the answer is fairly easy. The ideal / 
is a principal ideal generated by P — 3lx = 1(1 — 3x). In other words it describes 
two different situations. One is given by / = 0, a component over which the 
statement is algebraically false, the other one is given by ^ — 3x = 0 and the 
thesis polynomial t = P — 4lx + 3x^ is such that t G (I — 3a;). 

We conclude that the statement is algebraically true (hence a theorem) if 
the square is not a point. It is also interesting to observe that this case is not 
a limit case of our construction, but a truly different component. This suggests 
the following 

Definition 5. Let T be a statement, K a computing field for T . Let L be the 
hypothesis ideal ofT in K[xi, . . . ,Xn], t its thesis polynomial, and let pi, . . . ,pc 
be the minimal primes of \/l, so that pi fl . . . fl pc is the primary decomposition 
of Vl- We say that T is algebraically true on pi iftGpi. 

Therefore a statement can be algebraically false, but true on a component. 
This is what happens in the case of the trisection of a line segment. 
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Let us consider again the case of the Bisecting Diagonals (see Example 4). 
In this case it is easy to see that I = vT = pi H p2 Ll p3 fi p4, where 

Pi = (ici - 2a;3, X2 - 2 x 4 ) p2 = (a;2,a;4) p3 = (a;i,a:3) p4 = (a;i,a;2) 

and ti,t2 £ pi np4, while ti,<2 do not belong to p2 and to P3. So the statement 
is algebraically true only on pi and P4. But what is the geometric interpretation 
of the hypotheses expressed by the generators of pi and p4? For instance, it is 
not so clear how to read pi as a set of hypotheses. It is instead clear how to read 
p2, p3, p4- The vanishing of p2 represents the degeneracy of the rectangle to the 
line segment AB. The vanishing of p3 represents the degeneracy of the rectangle 
to the line segment AB>. The vanishing of p4 represents the degeneracy of the 
rectangle to the point O. 

So now it is clear that the vanishing of pi represents the same hypotheses as 
before (i.e. that O belongs to the line AC and to the line BD) applied to non 
degenerate rectangles. This is fine. It looks like we can find good hypotheses and 
good conditions, which corresponds to the obvious remark that a finite subset of 
a finite set can be described as the complementary set of its complementary set. 

But something even more intriguing is emerging from this example. Namely 
the algebraic analysis leads to the conclusion that the statement is true (hence a 
theorem) for rectangles which do not degenerate to a line segment plus rectangles 
which degenerate to a point. Even more than in the case of the trisection of a 
line segment, something is really defeating the intuition, namely the fact that 
this strongly degenerate case is not in the limit of the other degenerate cases. 

Let us explain why, with the help of the next picture, which is of a different 
nature with respect to the other pictures in the paper. It represents the compo- 
nents of the hypothesis ideal above. It is a homogeneous ideal, hence the four 
components Pi = {xi~2x3, X2-2X4), p2 = (x2,X4), ps = (xi,X3), p4 = (xi,X2) 
represent lines in P^, which we call Li, L2, L3, L4 respectively. 




Clearly Li, L4 are skew, as well as L2, L3. The interpretation goes as follows. 

The line L\ represents the component of non-degenerate rectangles. The lines 
L2 and L3 represent the components of rectangles which degenerate to the side 
AB and AD respectively. The line L4 represents the component of rectangles 
which degenerate to the point A. The projective point (2 : 0 : 1 : 0) represents 
the rectangles which degenerate to the side AB, while O is a point of the plane 
(even outside AB), such that the ratio of the ^-coordinates of B and O is 2. 
Analogously, the projective point (0 : 2 : 0 : 1) represents the rectangles which 
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degenerate to the side AD, while O is a point of the plane, such that the ratio of 
the x-coordinates of D and O is 2. The two points (2 : 0 : 1 : 0) and (0 : 2 : 0 : 1) 
are the true limits of the degeneration of the rectangle to one of its sides. 

On L2, T3 the statement is false, but then the statement is again true on 
L4. It represents the rectangles which degenerate to the point A. Here O can be 
taken arbitrarily. We said that on L3, L4 the statement is false, but there are 
exceptions represented by the projective points (0 : 0 : 1 : 0), (0 : 0 : 0 : 1). 
Let us explain the first one. It represents rectangles degenerated to the point A, 
with O on the x-axis. Therefore it is a limit case of the rectangles represented 
by L2- Although in general the statement is false on L2, it is true on the limit, 
because we keep O on the x-axis. The same explanation holds for (0 : 0 : 0 : 1). 

This example shows that to investigate the situation we need more tools. 
They are developed in the next section, which is the heart of the paper. 

3 Optimal Hypothesis Ideals and Good Conditions 

We have seen that in general the radical of the hypothesis ideal is not prime, so 
it can be expressed as the intersection of several prime ideals. A way to proceed 
could be the following: compute '/l, then compute its prime decomposition, and 
finally check on which prime components D is algebraically true. 

This approach is impractical for two different reasons. The first reason is 
the complexity of computing the prime components of the radical of I. But even 
more important is the difficulty of understanding the geometric meaning of some 
components. After a heavy computation we might conclude that we have proved 
a theorem, but we do not know which. 

Let us go back for a moment to the Bisecting Diagonals, and suppose we 
decide to exclude a priori the degenerate cases xi = 0 and X2 = 0. Then our 
hypothesis ideal becomes 



I : (xiX2)°° = (xi - 2x3, X2 - 2x4) 

on which it is immediate to verify that the statement is true. We can safely say 
that the statement is verified under the condition that xi yf 0 and X2 yf 0. It 
means that it is verified providing the rectangle is truly a rectangle. With this 
condition it is a theorem. 

Another possibility which may happen is that t does not belong to any of 
the prime components of the hypothesis ideal. Then the statement should be 
considered to be absolutely false. 

To clarify these situations we need more technical devices from Commutative 
Algebra. 

Lemma 1. Let R be a noetherian ring, I an ideal in R, f,fi,---,fr^R\ {0}, 
and J = {fi, . . . , fr). Then 

a) I : = IRfHR 

b) I: joo ^ 
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Proof. The proof of these elementary facts can be found for instance in [9], 
Section 3.5. 

Lemma 2. Let R be a noetherian ring, I an ideal in R, and f,g G R\ {0}. 
Then the following conditions are equivalent: 

a) g e y/ I ■■ f°° 

b) f e y/I : g°° 

c) IGI: {fgr 

Proof. Let us prove a) c). Using g G ^yI : f°° we deduce that there exists a 
natural number d such that g‘^ G I : f°° , hence there exists a natural number e 
such that {gfY G Y which implies that 1 G / : {fgY. Conversely, if 1 G / : {fgYi 
then g^ G I : f°°, hence g G \// : /“. Condition c) is symmetric on / and g, 
therefore the proof is complete. 

Lemma 3. Let R be a noetherian ring, let L, J be ideals in R, and let g G R. 
Then 

a) V/ : J°° = y/l : J = y/l ■.'/j 

b) L : J°° = {Y if and only if J ^ Vl 

c) L : g°° C Vr if and only if g \ 0 mod \/l 

Proof. Let us prove the first equality of a). To show that V I : J°° Q Vl ' J, 
we pick an element a G Vl : J°°. There exists a natural number d such that 
a‘^ G I : J°° , hence there exists a natural number e such that (aJ)® G Yl. 
Therefore aJ C -/l, and we may conclude that a G Y1 : J. Conversely, let 
a G '/l : J. Then there exists a natural number e such that {aJY G I. Therefore 
a G Vl : J® C ■■ J°°. 

To prove the second equality, it suffices to show that : J C ^/I : -/j, 
since the other inclusion is obvious. For aJ C \/l there exists a natural number 
d such that {aJY Q I- We have to show that for b G VJ, then ab G Yl. Now, 
for b G VJ there exists a natural number e such that 6® G J, hence (o6®)'^ G I, 
hence ab G \/l. 

It is clear that b) follows from a), since L : J°° = (1) is equivalent to 

Yl ■ = (1). 

Finally we prove c). From / : g°° C yY we deduce that ^/TT~g^ C Vj, 
therefore vT : g = y/l, by a). Conversely, if g | 0 mod -\/7, then -\/j : g = \/l, 
which implies • 9°° = V^, by a). The conclusion is that / : g°° C \/j. 

Lemma 4. Let R be a noetherian ring, let I be an ideal in R, and let g G R. 
Assume that '/l = pi fl . . . fl ps fl qi fl . . . fl q* is a minimal prime decomposition 
of y/1 , where ^ G pi fl . . . fl ps, and g ^ q^ for j = 1, . . . ,t. Then 

a) ■ 9°° = qi n . . . n qt 

b) - {I ■■ 5°°)°° = pi n . . . n P, 

Proof. To show a), we recall from Lemma 3. a that y/I ■ 9°° = : 5, and the 

claim follows easily. Now let us prove b). We use again Lemma 3. a to see that 
'. {I '. g°°)°° = \/l : Y {I '■ 9°°)- The assumption and a) imply that we have 
to show the equality (pi fl . . . fl ps fi qi fl . . . n q*) : (qi fl . . . fi qt) = pi fl . . . fl ps, 
whose proof is immediate. 
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In the following we let hi, . . . ,hr,the polynomials in M[a:i , . . . , Xn] ■ Then we 
let T be a statement in Euclidean geometry, whose hypothesis ideal is the ideal 
in , Xn] generated by {hi, . . . , hr), and whose thesis polynomial is t. Let 

K he a, computing field for T and let I{T) be the ideal generated by hi,. .. ,hr 
in P = K[xi, . . . ,Xn]. 

Due to Proposition 1, all the operations described in the above lemmas can 
be considered in a polynomial ring over K. 

To say that t G ^/l{T) ■ f°° means that P is algebraically true under the 
condition / yf 0, as we can see from Lemma 1. Therefore Lemma 2 shows that 
VWh t°° is the container of all the conditions. We see the ideal /(T) : t°° 
becoming an important actor. It deserves a name. 

Definition 6. The ideal /(T) : is called the condition ideal o/T. It is de- 

noted by Ic{P) or simply by P if no confusion arises. 

Definition 7. Let J = {fi, . . . , fs) be an ideal in P. A statement T is called 
algebraically true under the conditions expressed by J (or algebraically true 
if at least one of the conditions fi yf 0, ... ,fs yf 0 holds true^, if the revised 
statement, whose hypothesis ideal is I{T) : and whose thesis polynomial 

is t, is algebraically true. A statement T is called absolutely false if the thesis 
polynomial does not belong to any minimal prime of the hypothesis ideal. 

Theorem 3. Let T be a statement in Euclidean geometry, whose hypothesis 
and thesis are expressed by the vanishing of polynomials hi, . . . ,hr,t. Let K be 
a computing field for T and let I{T) be the proper ideal generated by hi, ... ,hr 
in K[xi, . . . , Xn] . Then the following facts hold true. 

a) We have t G a//(T) : /c(T)°°. 

b) The following are equivalent. 

1 ) The statement T is algebraically true. 

2) Ir{T) = (1) 

3) ./wYTiAf)^ = ./im 

c) The following are equivalent. 

1 ) The statement T is absolutely false. 

2) ic{T) c /f(r) 

3) I{T) : Ic{T)°° = (1) 

d) If the statement T is neither algebraically true nor absolutely false, then we 
have the chain of strict inclusions \/l{T) C \/l{T) '. Ic{‘T)°° C (1). 

Proof. Claim a) is immediate from the definition. We prove Claim b). From 
Theorem 2 and Lemma 3.b we deduce the equivalence between 1) and 2). Since 
2) 3) is clear, it remains to show that 3) =k 1). This implication follows 

directly from a). 

Now we prove c). From Definition 7 and Lemma 3.c. we see the equivalence 
between 1) and 2). The equivalence between 2) and 3) is clear. 

Finally we prove d). From 3) of b) we get the first strict inclusion, from 3) 
of c) the second one. 
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The above theorem indicates that most of the information about the validity 
of T is stored in the ideal I{T) : Ic{‘T)°°- In particular we see that if T is 
neither algebraically true nor absolutely false, then what is algebraically true 
is the revised statement, whose hypothesis ideal is I{T) ■ Ic{T)°° , and thesis 
polynomial is t. But we can say more with the help of Lemma 4 . 

Definition 8. An ideal which contains the hypothesis ideal I, and whose prime 
components are exactly those on which T is algebraically true is called an optimal 
hypothesis ideal for T- 

Corollary 2. With the same assumptions as in Theorem 3 , an optimal hypothe- 
sis ideal for T is the ideal I{T) : Ic{T)^ ■ 

Proof. The proof follows immediately from Lemma 4 .b. 

This fact motivates the following definition 

Definition 9. The ideal I{T) : Ic{T)°° will be denoted by Iop{T). 

What happens if there are more than one thesis polynomials? Suppose that 
there are r > 1 thesis polynomials ti. In that case it is easy to see that the 
following equality Ic(T) = d(T) : (ti, ■ ■ ■ holds true. 

Remark 1 . With this terminology some parts of Theorem 3 can be rephrased in 
the following way. 

The statement T is algebraically true if and only if i//op(T) = a/ / (T) . 

The statement T is absolutely false if and only if lopi'T) = ( 1 ). 

This could be the final point of our investigation. However, if we do the 
computation in the case of the Bisecting Diagonals Theorem, we get that the 
optimal hypothesis ideal (/:(/: (ti, *2)°°)°°) is equal to 

{Xi — 2X1X3, X1X2 — 2X1X4, X2 — 2X2X4, X2X3 — X1X4) 

We are unable to read these hypotheses. As we had warned before, we have 
proved a theorem, but we do not know exactly which. Even if we are able to 
decompose this ideal, we get the two components pi = {x\ — 2x3, X2 — 2x4), 
p 4 = {xi,X2). As we observed, we understand p4, but it is not clear how to 
interpret pi. 

This fact suggests the idea that sometimes it could be better to describe the 
validity of a statement not by describing the components on which it is true, 
but by listing the components on which it is algebraically false. The advantage is 
that we can do that by throwing away hypersurfaces, i.e. by imposing conditions. 
The disadvantage can be, as it happens in the case of the Bisecting Diagonals 
Theorem when we put X\X2 yf 0 , that we throw away even a good component. 
This is life. 

In any event, it is better to further investigate conditions. 
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Remark 2. We recall that in the paper [11] a polynomial / is called an irrelevant 
condition for T, if (d, /) = (1). Now, either T is algebraically true or not. In 
the latter case Ic yf (1) by Theorem 3. a. If / G 7c is a condition, then we 
get (7,/) C {Ic,f) ^ Ic ^ (1), hence / is not an irrelevant condition. This 
observation implies that there is no need to consider irrelevant conditions. This 
observation was already stated in [11] (see Exercise 4.7). 

Inside the ideal IdT) we may encounter polynomials fi G \/l{‘T) and poly- 
nomials fj ^ a/ 7 (T) . A polynomial fi of the first type should be considered 
useless, since 7(T) : f°° = (1), so that if we consider the statement under the 
condition fi yf 0, there is no statement anymore. These conditions are called 
trivial in [11]. Let us formalize this notion. 

Definition 10. A condition f G Ic{T) is called trivial if f G i/Iild- 

We observe that if / is a non-trivial condition (hence T is not algebraically 
true), then \/l{T) : f°° = \/l{T) : f is a proper ideal which contains \/ 1{1^) 
strictly. Therefore the condition deletes some but not all of the prime components 
of id I possibly some good ones. A first simplification can be done in the 
following way. We decide to sacrifice the possibility of checking the validity of T 
on all the components, by imposing all the non-degeneracy conditions, which can 
be spotted immediately from the assumptions. For instance in Example 6 it is 
clear that we may assume Z yf 0. The observation suggests that we can substitute 
the hypothesis ideal 7 = {P — 3lx) with the new hypothesis ideal I : l°°, and this 
choice leads immediately to the conclusion that the statement is algebraically 
true. 

Now the next question is: can we find a good set S of conditions, i.e. a set 
S with good properties, and such that Iop(T) = I(T) : 7^(7")°° = 7(T) : 

We propose the following definition. 

Definition 11. Given a set of generators E of Ic{T), a subset S C E is called 
an almost good set of conditions for T if 

a) Iop{T) = I{T) : (^)- 

b) For every pair (f,g) of elements in S, such that I{T) : f°° yf 7(7") : g^, 

there is no inclusion relation between I{T) : and I{T) : 5 °°. 

It is called a good set of conditions for T if 

a) It is almost good. 

b) For every pair (f,g) of elements in S, I{T) : f°° yf 7(T) : g°° holds true. 

Proposition 2. With the same assumptions as in Theorem 3, let G be a set of 
generators of the ideal R. We define a partial ordering on G by putting a < b 
if I : a°° C I : b°° . We let S' be the subset of G of the minimal elements. Then 
we consider the equivalence relation on S' which is defined by putting a = b if 
I{T) : a°° = 7(T) : b°° . Let S be a set of representatives of the equivalence 
classes. 

a) The set S' is an almost good set of conditions for T. 
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h) The set S is a good set of conditions for T- 

c) IfT is not absolutely false, then S does not contain trivial conditions. 

Proof. The first two claims follow from the definitions, so let us prove the third. 
If T is not absolutely false, then Ic 2 by Theorem 3.C.2, hence there exists 
at least one non-trivial condition b among the elements in G. Let a be a trivial 
condition in G. Then / : a°° = (1), while / : 6°° C (1), so that a is discarded. 

Remark 3. It is possible to get better set of conditions, by imposing the following 
constraints 

a) Iop{T) = I{T) : (^)- 

b) n (/(T) : /f ) 2 I{T) : /r, for every z = 1, . . . , s. 

Such a set can be called an optimal set of conditions. However, the experience 
shows that to find optimal set of conditions is too demanding from the compu- 
tational point of view. 

Remark 4- While optimal hypothesis ideals depend only on the data of the prob- 
lem, good and optimal sets of conditions depend on the chosen set of generators 
of Ic{T), so that they are not intrinsic to the problem. Nevertheless good sets 
of conditions play an important role in the algorithms which we are going to 
describe in the next section. 

4 The Algorithms, the CbCbA. Code and Some Examples 

At this point we have the possibility of describing an algorithm, which largely 
automatizes the process of proving that a statement in Euclidean geometry is a 
theorem. The beginning of the process is a step which aims at optimizing the 
hypotheses. However, it may delete some degeneracy components on which the 
statement is true. 

Theorem 4. Let T be a statement in Euclidean geometry, whose hypothesis 
and thesis are expressed by the vanishing of polynomials hi, . . . ,hr,t. Let K 
be a computing field for T and let L be the ideal generated by hi,..., hr in 
P = K[xi , . . . , Xn] . Consider the following sequence of instructions. 

0) ( Optional preprocessing) Let g be the product of all the degeneracy conditions 
to be excluded a priori. Compute L : g°° , and call this ideal L again. 

1 ) Compute Lc . 

2) Lf 1 G Ic, then return ‘ 'The statement is algebraically true" and 
stop. 

3) Let G be a set of non- zero generators of Ic. 

4) Compute I : f°° for every f G G. If 1 G I : f°° for all f G G, then return 
' 'The statement is absolutely false" and stop. 

5) Consider the partial order on G given by a ^ b if I : a°° Cl : b°° , and let S' 
be the set of the minimal elements of G with respect to Then consider the 
equivalence relation on S' given by a = b if I : a°° = I : b°° , and let S C S' 
be a subset of representatives of the equivalence classes. 
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6) Compute J = I : 

1) Return J, S and stop. 

This is an algorithm which decides whether T is algebraically true or absolutely 
false. In the case that T is neither algebraically true nor absolutely false, the 
algorithm finds an optimal hypothesis ideal and a good set of conditions for T . 

Proof. The first instruction requires the computation of Ic. If /c = (1) then T is 
algebraically true by Theorem 3. a. If R is a proper ideal, then there exist non- 
constant polynomials fi,. . . , fs such that Ic = (/i, ■ ■ ■ , /«)• Let us consider the 
ideals / : ff°. We know that fi € Ic hence fi G '/If = /l ' t°° . Then Lemma 3 
implies that t G /I : ff°. Since we know that t ^ /I, we may conclude that 
/I : ff°, which is equal to /I : fi by Lemma 3. a, is strictly bigger than /I. 

If all the fi belong to /I, then Ic C /I and T is absolutely false by Theo- 
rem 3.b. 

It remains to show that if T is neither algebraically true nor absolutely false, 
then S' is a good set of conditions for T. This fact follows from Proposition 2. 

Remark 5. It is clear how to modify Step 5 to get optimal sets of conditions 
instead of good sets of conditions. It is also clear that Steps 5, 6 can be used 
interchangeably. If one deletes step 5, then Step 7 reads: Return J and stop. If 
one deletes step 6, then Step 7 reads: Return S and stop. 

Now we illustrate the CoCoA code, which transforms the algorithm described 
above into an executable code. We insist on the notion of good set of conditions, 
which is the most important from a computational point of view. 



Algorithm: Almost Good Set of Conditions 

Define AlmostGoodSetOf Conditions (L) 

AlmostGood := [] ; 

Foreach ElementOfL In L Do 
Inserted := FALSE; 

I := 1; 

While I <= Len(AlmostGood) And Not Inserted Do 
If AlmostGood [I] . Ideal = ElementOfL . Ideal Then 
Append (AlmostGood [I] .Polys, ElementOfL . Poly) ; 
Inserted := TRUE; 

ElsIf AlmostGood [I] . Ideal > ElementOfL. Ideal Then 
AlmostGood[I] := Record(Ideal=ElementOfL . Ideal , 

Polys= [ElementOfL . Poly] ) ; 

Inserted := TRUE; 

ElsIf AlmostGood [I] . Ideal < ElementOfL. Ideal Then 
Inserted := TRUE; 

Endlf ; 

I := I+l; 

EndWhile ; 

If Not Inserted Then 

Append (AlmostGood, Record(Ideal=ElementOfL . Ideal , 

Polys= [ElementOfL . Poly] ) ) ; 



Endlf ; 
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EndForeach; 

Return [X. Polys I X In AlmostGood] 
EndDefine; — AlmostGoodSetOf Conditions 



Algorithm: Proving 

Define Proving(Hypothesis , Thesis) 

Conditionsideal := Saturation(Hypothesis , Ideal (Thesis) ) ; 

If ConditionsIdeal=Ideal(l) Then 
Return Record (Statement=TRUE) ; 

Endlf ; 

Conditions := Gens (Conditionsideal) ; 

ConditionsIdealList := 

[ Record(Ideal=Saturation(Hypothesis , Ideal(C) ) , Poly=C) I 
C In Conditions] ; 

If [ X In ConditionsIdealList I X . IdealOIdeal(l) ] = [] Then 
Return Record(Statement=FALSE, Conditions= [] ) ; 

Endlf ; 

Return Record(Statement=FALSE, Conditions = 

AlmostGoodSetOf Conditions (ConditionsIdealList) ) ; 

EndDefine; — Proving 

or 

Return Record(Statement=FALSE, OptimalHypothesisIdeal = 

IntersectionList ( [I . Ideal I I In ConditionsIdealList])); 

Remark 6. It should be noticed that Algorithm Proving does not return the good 
set of conditions S, as described in Theorem 4, Step 5. It returns an almost good 
set of conditions S' . The transition from S' to S is done by human judgment, so 
it belongs to a phase which could be called postprocessing, as we shall see with 
the help of some examples. 



The first example that we are going to investigate is Example 5. 

Example 5 (continued). In the preprocessing phase we decide to exclude the 
possibility that the segment AB degenerates to a point. We do that by replacing 
the hypothesis ideal (/ii, / 12 , h^) with the new hypothesis ideal / = (/ii, / 12 , h^) : 
(xi)°°. Then we compute a set of conditions by using Algorithm Proving, and 
get 

[xa Xi — \/\Q\/bx\ — l/10\/5xi X4 + l/\Q\/bx\ — l/lQx\ — \/ 2 x\ X4 — \/\Qx\ , 
y/(>x\ X 4 + x\ — V^X 4 + 2 xf + X 1 X 4 — 3x1 X 4 + x\, 

•\/5xi — 2 'Jhx\ X4 + 'Jhx\ — X I + 2 xi X 4 + 4xi x| — 8 x 4 ] . 

All the conditions are equivalent. We conclude that a good set of conditions 
is 

{V 5 xl X4 + -\/ 5 xi X4 — ^fbx\ + 2xi + Xi X4 — 3 xi X4 + x| } 

We can factor the polynomial and get 

{V5x4 + 2xi — X4)(xi + 1/2x4 — ■\/5/2 x4)(xi + 1/2x4 + ■\/5/2 x4) 
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We conclude that the statement is algebraically true under the conditions that 

. The last condition excludes the situation where B is 
external to the segment AC. 

You remember that a proposed alternative way was to impose that AE is 
the golden section of AB. In this way we were able to use Q as the computing 
field. 

Then we compute a set of conditions by using Algorithm Proving, and get 

[xf + X1X4 — X4, 

X 3 — 2/5x1 — 1/5x4 , 

X2 — 4 / 5 xi —2/5x4] 

All the conditions are equivalent. We conclude that the statement is alge- 
braically true under one of these conditions. The last two are not clearly inter- 
pretable. However, if we work in Q[-\/ 5 ] [xi, X2, X3, X4], we factor x^ -|-Xi X4 — x| = 
(xi -I- 1 / 2 x 4 + \/5 x4)(xi -I- 1 /2 x4 — \/5 x4 ). We can conclude that the statement is 
algebraically true under the condition ^ . As before, the second 

condition excludes the situation where B is external to the segment AC. 

Example 1 (Isosceles or Right-angled Triangles) . Let be given a triangle which is 
isosceles or right-angled. Show that the center of the circumscribed circle belongs 
to one of the sides of the triangle. 




We introduce Cartesian coordinates, and place the triangle as in the picture. 
Then we have 



A =( 0 , 0 ), H=(xi, 0 ), C'=(X2,X3), D={x4,X5) 

DA = DB yields the polynomial hi = 2x4X4 — xf. 

DA = DC yields the polynomial /12 = x| -I- x§ — 2x2X4 — 2x3X5. 

We assume that the triangle is isosceles or right-angled in C. 

AC = CB or AC T CB yields the polynomial ft-3 = xfx2 — 8x^X3 -I- 2x4X2 — 
x\x\ 2x4X2X§. 
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The thesis D G AB yields the polynomial t = X1X5. Therefore a computing 
field is Q and we may work in the polynomial ring X2, X3, X4, x^]. 

We run Algorithm Proving without any preprocessing, and get the following 
almost good set of conditions 
[x^Xi - X3X2, X3 - X3X4 - 2x1x3], 

The statement is not a theorem. Using the equivalence relation introduced 
in Theorem 4, Step 5, we check that the two conditions found are equivalent. 
Therefore a good set of conditions is a single polynomial, and we can choose 
between X3X4 — X3X2 and X3 — X3X4 — 2x§X5. 

We enter the phase of the postprocessing. We observe that it is easier to 
interpret the first condition. It means that the statement is algebraically true 
under the conditions that X3 yf 0 and X4 yf X2- Clearly, X3 yf 0 excludes the 
degenerate case that the triangle degenerates to the segment AB, while X4 yf X2 
excludes the case that the triangle is isosceles. 

We run the second variant of the Algorithm Proving and get the Optimal 
Hypothesis Ideal 

(X2 + X3 — 2X2X4 — 2X3X3, x\ — 2 x\X 4 , X4X3) 

As it happens frequently, the optimal hypothesis ideal is difficult to read. In 
this case we can understand why. Namely, in this case we can compute 

V7 = pi n p 2 n p3 n p4 n p5 



where 

pi = {xi,X2 + x\ — 2X2X4 — 2X3X3) p2 = {X3, Xi — X2, 2x4 ~ Xx) 
p 3 = {.X3, X2, 2 X 4 - Xi) p4 = (x4 - X2 , - X4 - 2X3X3, 2 x 4 ~ Xi) 

P5 = {X3, x\ + x\ — 2 X 2 X 4 , 2X4 — Xi) 

It is easy to check that t G ps e t G pi, but t ^ p2, t ^ p3, < ^ p4. 

In conclusion the statement is true on the component which represents the 
right-angled triangles, and on the component which represents the triangles 
which degenerate to the segment AC. This is why the optimal hypothesis ideal 
is complicated. 

We could follow a different path by doing some preprocessing. Suppose we 
want to avoid the cases where the triangle degenerates to a segment. This means 
that the ideal I should be replaced by the ideal / : (a;ia;3)°° and t = X3. Let us 
do that. We compute and get 

I :(a;ia;3)°°= {xi—2x4, X2X3—X4X3, x\+x\— 2x2X4— 2x3X3, X^X3—X4X3—2X3X^) 

We run Algorithm Proving on this new ideal, and get the following set of 
almost good conditions [x2 — X4, x§ — X4 — 2 ai 3 a; 5 ]. 

Again we check that the two conditions are equivalent, we use the first and 
conclude that the statement is true unless the triangles are isosceles, hence it is 
true for right-angled triangles. The optimal hypothesis ideal turns out to be 

(X3, Xi — 2 X 4 , X2 + X3 — 2x2X4) 
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Now it should be clear that it expresses that the triangle is right-angled in C. 

Example 8 (Tangent and Secant). Let be given a circle C and an external point 
P. Through P consider a tangent line I and a secant line s to the circle. Then 
the square of the distance of P to the point of contact of I with C is equal to 
the product of its distances to the intersection points of s with C. 




We introduce Cartesian coordinates, and place the axes as in the picture. Then 
we have 

P = (0,0), A = {xi,0), B = {x 2,0), M={a,b), T = (x 3 ,y) 

TM = AM yields the polynomial hi = {x^ — a)^ + {y — b)^ — {x\ — a)^ — 6 ^. 
AM = BM yields the polynomial /12 = (a^i — a)^ — {x2 — a)^. 

PT T MT yields the polynomial /13 = X 3 {x 3 — a) -I- y{y — b). 

The thesis PT = PA ■ PB yields the polynomial x\ + y"^ — X\X2. 

Therefore a computing field is Q and we may work in the polynomial ring 
Q[a;i,a;2,a;3,i/,a, 6]. 

We run Algorithm Proving without any preprocessing, and get the following 
good set of conditions [xi — X2]. 

This comes somehow as a little surprise. It means that the statement is false 
on the component determined by the choice A = B, i.e. when we do not specify 
that A, B are the two points of intersections of the secant line with the circle. 

It is a typical situation where it is very easy to overlook a degeneracy condi- 
tion. 

Example 9 (Feet and Midpoints). In every triangle the circle passing through the 
feet of the three altitudes intersects the sides of the triangle in their midpoints. 

We introduce Cartesian coordinates, and place the geometric objects as in 
the picture. Then we have that A = (0,0), B = (a;i,0), C = (x2,X3) are the 
vertices of the triangle, D = (x2,0), E = (x4,X5), F = {xq,xy) are the feet of 
the altitudes. Finally let O = (xs,xg) be the center of the circle, M = (cci/2, 0) 
the medium point of AB. 

F € AC yields the polynomial hi = —x^xq + X2X7, 

AC T FB yields the polynomial /12 = a;ia:2 — X2Xq — X3X7, 

E G CB yields the polynomial ft-3 = —X1X3 + X3X4 + X1X5 — X2X5, 
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AE _L BC yields the polynomial ft.4 = — a;ia;4 + X2X4 + X3X5, 

OE = OD yields the polynomial = —x\ + x\ + x\ + 2x2Xg, — 2x4X% — 2 x 5 X 9 , 
OD = OE yields the polynomial he = X2 — Xq — x'^ — 2x2Xg + 2 xgX 8 + 2x7X9. 




The thesis OM = OD yields the polynomial t = l/Ax\ — x^ — XiXg + 2x2Xg. 
Therefore a computing field is Q and we may work in the polynomial ring 
Q[xi, X2, X3, X4, X5, xe, X7, xs, xg]. In the preprocessing phase we decide to ex- 
clude some degenerate cases. Namely we exclude the possibility that the triangle 
degenerates to a segment, by replacing the hypothesis ideal (ft-i, ft-2, ^3, ft-4, ft-5, ft-e) 
with the new hypothesis ideal I = {hi, h2, ft.3, ft.4, h^, Hq) : (xiX3)°°. 

Then we compute an almost good set of conditions by using Algorithm Prov- 
ing, and get 

[X5X7X8 — 2X5X6X9 + X4X7X9 — 2X7X8X9 + 2 xeXg, 

x|x7 -b X7 — 2X6X7X8 — 2X7X9 -b 4X6X7X8X9 — 4 x|xg, 

X1X5 -b X5X6 — X4X7 — 2X5X8 -b 2x7X8 — 2 xiXg -b 2x4Xg — 2 x 6 Xg, 

X3X5 -b 2X2X8 — 2X4X8 — 2X5X9, 

X3X6 - X4X7, 

X4X6 — Xg -b X5X7 — X7, 

X1X3 — X4X7 — 2X3X8 -b 2X7X8 — 2 xiXg -b 4X2X9 — 2X5X9, 

X2X4 — x| — X7 — 2 X 2 X 8 -b 2X5X8 -b 2X7X9, 

X3X4 — 2X5X5 — 2X3X8 -b 2X5X8 -b 4X2X9 — 2X4X9, 

X1X2 — X5 — X5X7 — X7 — 2X2X8 -b 2X5X8 -b 2X7X9, 

^ 2^5 ^ 5 ^ 6 ; 

X2X5 — x| -b X5X7 — X7, 

X3X7 — 2X5X7 — 2x2X8 -b 2X5X8 -b 2x7X9, 

X2X7 — X4X7, 

X5X5X7 - 2X5X6X9, 
x|x7 — 2X5X7X9, 

X4X5X7 — 2X5X6X9, 

X5X7 — 2X5X7X9, 

X4X7 — 2X7X8 — 2X4X7X9 -b 2X6X7X9 -b 4X7X8X9 — 4 x 6 x|, 

X5x| — I/2X5X7 — 1/2X7 ~ X^XqX^ -b X5X7X8 — X§Xg -b X^X-jXg, 
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X%Xq - 2x^XqXq, 

XiX^ — 2XyXs — 2 xiXiXq + 2xqX'^Xq + AxyXsXg — 4x63^9] 

All the conditions are equivalent. We conclude that a good set of conditions 
is {xgx^ — x^xg}, and that the statement is algebraically true under the condition 
that X 2 X 5 — X 5 XQ yf 0, which means that the triangle should not be right-angled. 
In this case the Optimal Hypothesis Ideal is difficult to read. 

Example 10 (Parallelogram). Given a parallelogram, the intersection point of 
the diagonals lies on a side. 




We introduce Cartesian coordinates, and place the geometric objects as in the 
picture. Then we have that A = (0,0), B = (xi,0), C = (x2,X3), D = {xi,x^) 

are the vertices of the parallelogram, and O = (xq^xt) is the intersection point 

of the diagonals. 

AB II DC yields the polynomial hi = —X1X3 + X1X5, 

AD II BC yields the polynomial /12 = —X3X4 — X1X5 + X2X5, 

O G AC yields the polynomial /13 = -x-^Xq + X2Xt, 

O G DB yields the polynomial ft .4 = —X1X5 + x^xq + X1X7 — X4X7. 

The thesis O G AB yields the polynomial t = X1X7. Therefore a computing 
field is Q and we may work in Q[a;i, X2, X3, X4, X5, xg, X7]. 

Then we compute an almost good set of conditions by using Algorithm Prov- 
ing, and get 

[xf — X4 — 2xiXq + 2x4Xq, 

X1X2 — XiXq — X2X3 + X4XQ, 
x\ — 2X2X3, 

X2X4 + XIX3 — X2X3 — X4X3] 

This is a case where, no matter which condition we take, its understanding 
is not trivial. However, we may compute the Optimal Hypothesis Ideal and get 
that the Optimal Hypothesis Ideal is the ideal 



(a;ia:5, X1X3, x^Xq - X4X7, X3X3 - X2X7, X3X4 - X2X3, X1X7) 



whose meaning is clear. We conclude that the statement is algebraically true for 
the parallelograms which degenerate into a segment. 

In this case the possibility of working with two options was useful for the 
correct interpretation of the output. 
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Abstract. Proof by refutation of a geometry theorem that is not uni- 
versally true produces a Grobner basis whose elements, called side poly- 
nomials, may be used to give inequations that can be added to the hy- 
potheses to give a valid theorem. We show that (in a certain sense) all 
possible subsidiary conditions are implied by those obtained from the 
basis; that what we call the kind of truth of the theorem may be derived 
from the basis; and that the side polynomials may be classified in a useful 
way. We analyse the relationship between side polynomials and kinds of 
truth, and we give a unified algorithmic treatment of side polynomials, 
with examples generated by an implementation. 



1 Algebraic Preliminaries 

Throughout, let n be a positive integer and L a fixed field containing the field 
Q of rational numbers. Let Q[A„] be the ring of polynomials with n- variable set 
Xn = {xi, X2, . . ., Xn} Over Q. 

Let F C Q[A„] and / G Q[A„]. We call {F, f) a possible theorem. {F)x„ 
denotes the ideal generated by F in Q[A„]. For a G L"^ we denote by /(a) the 
result of substituting a for their corresponding variables in / and evaluating. 
For F C Q[A„], let 

Cx^(F) = {/gQ[A„] I for all a GL"', h{a) = 0 for all h gF implies /(a) = 0}, 

an ideal of Q[A„] as is easily checked. If the choice of polynomial ring is clear we 
often write just C{F). The ideal Cx^F) depends on the field L. For example if 
n = 1 and F = {x^-bx}, then if L = i? is the field of real numbers, Cxi (F) = (x), 
because x^ -I- x has only one root x = 0 in i?. If L = (7, the field of complex 
numbers, then Cxi (F) = (x^ -b x), because x^ -b x has three roots x = 0, *, —i in 
C. If L is algebraically closed, then by Hilbert’s Nullstellensatz, Cx„{F) = {/ G 
Q[A„] I G (F)xA, the radical ideal generated by F in Q[A1„]. 

Dually, for F C Q[A„], let 

Vx„{F) = {a I a G L", /(a) = 0 for all / G F}, 
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the variety associated with F C Q[X„]. Again, we often write just V(F) if the 
context is clear, and if F = {/}, a singleton set, we shall often write Vx„{f) (or 
just V(/)) rather than Vx„ ({/})• Varieties are closed under arbitrary intersec- 
tions and finite unions. 

There is a familiar dual isomorphism between the lattices of varieties of the 
form Vx„(G) and ideals of the form Cxr^(G), G C Q[A„]. Because the lattice 
of varieties is distributive and satisfies the descending chain condition, every 
variety V has a unique decomposition 

V = Vi U V2 U • • • U 

into distinct irreducible components (varieties which cannot themselves be ex- 
pressed non-trivially as unions of two or more proper subvarieties). 



2 The Kind of Truth of a Possible Theorem 

In geometrical theorem proving, the higher level statement of a valid geometry 
theorem naturally translates into an equational implication involving polyno- 
mials, with geometrical predicates such as “points A, B and G are collinear” 
becoming polynomial equations via coordinatisation. Further, one can require 
that certain variables be treated as independent, in the sense that no algebraic 
relations are assumed to hold amongst them. Chou has argued in j2j that the 
specification of the independent variables in a geometry theorem is an integral 
part of the algebraic formulation: such variables are chosen according to a no- 
tional “construction” that takes place when the geometry theorem hypotheses 
are read in order. It is this approach we adopt. 

Formally, suppose the variables in {7 C are specified as being independent; 
view Q[U] as a subring of Q[A„]. We say that a variety V is U -generic (or 
simply generic if the choice of U is clear) if no polynomial in Q[f7] is zero on 
all of V. If L is algebraically closed, then Vx„(A') is 17-generic if and only if 
{F)x„ nQ[C7] = {0}; this is called U -independence in |^. Let Qx„{P) denote the 
union of all U -generic irreducible components of Vx„(A'). 

We can now define the four basic kinds of truth of a possible theorem: for 
F U {/} C Q[A„], we say the possible theorem {F, f) is 

1. universally true if / is zero on all of Vx^(F), that is, /(a) = 0 for all a € 

VxAF); 

2 . generically true if / is zero on Gx„{F); 

3. generically conditionally true if there exists an irreducible component of 
Gxn{F) on which / is zero but / is not zero on all of Gx^{F); 

4. generically false there is no irreducible component of Gx^{F) on which / is 
zero. 

We are further able to separate the final category into two subcategories. A 
generically false possible theorem is: 
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— degenerately true if there is no irreducible component of Gx„ (F) on which / 
is zero, yet there is at least one irreducible component of Vx^iF) on which 
/ is zero; 

— rarely true if there is no irreducible component of Vx^ {F) on which / is zero. 

We call these kinds of truth 1 to 4 respectively, with kind 4 split into 4(a) 
and 4(b) as above. Note that a possible theorem is of at least one of these five 
types, and exactly one of types 2 to 4 (b). We shall show how to determine the 
kind of truth of a possible theorem using Grobner bases for the case where L is 
algebraically closed. 

Generic truth reflects the idea that the conclusion holding on the generic 
irreducible components of the hypotheses is what is “really intended” by the 
author of a theorem. Generically conditional truth occurs when there is some 
ambiguity in the hypotheses of the theorem and the conclusion will not hold 
on all generic irreducible components; this is in practice a rare situation but 
can occur as illustrated in |2| and elsewhere. Generic falsity occurs when the 
conclusion is valid on no generic irreducible components. In contrast to rare 
truth, degenerate truth is at least a form of conditional truth, and the fact that 
it can be algorithmically distinguished from rare truth has led us to define it 
separately. 

Most of these kinds of truth have in essence been considered elsewhere along 
with algorithmic ways of determining the kind of truth of a possible theorem (see 
m and more recently m for instance) . However, so far as we know, no unified 
algorithmic treatment of the kind presented here, applying to all the kinds of 
truth discussed here and providing a complete set of side polynomials for each, 
has appeared previously. 



3 Side Polynomials and Kinds of Truth 

For the remainder of the article, F and / will be an arbitrary subset and an 
element of Q[X„] respectively. 

In 0, a method described by Kapur as refutational theorem proving was 
considered for proving geometry theorems translated into polynomial equations 
in this way. Kapur’s approach was based on the idea of considering the conjunc- 
tion of the hypotheses of the theorem with the negation of the conclusion (in a 
certain sense), forming a Grobner basis and determining if it was (l)jc„; if so, 
then the theorem was validated and if not the polynomials in the basis could be 
used to give side conditions in the form of inequations which could be added to 
the hypotheses in order to give a valid theorem. 

Such side conditions prove necessary because most standard Euclidean ge- 
ometry “theorems”, as normally stated, are not universally true, owing to the 
absence from the hypotheses of certain additional non-degeneracy conditions 
that may be represented algebraically as inequations. These may correspond to 
hypotheses of the form “points A, B and C are not collinear”, and so forth. Of- 
ten, such extra hypotheses are not easy to guess, as is discussed at length in 
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0, although synthetic proofs of geometry theorems make at least tacit use of 
them. Note that any finite number of inequations may be expressed as a single 
inequation. 

We show that all possible side conditions (in a certain natural sense) are 
implied by those obtained from the Grobner basis used in Kapur’s method, and 
that the kind of truth of the theorem may be derived from this basis; moreover, 
the side polynomials may be classified in a useful way. 

Formally, we say that g £ Q[-^n] is a side polynomial for (F, f) if /(a) = 0 
for all a £ V{F) for which g{a) ^ 0. Let the set of all side polynomials for (F, /) 
be denoted side{F,f). 

Theorem 1. For the possible theorem (F,f), C{side{F, f)) = {side{F, f))x„ = 
side{FJ). 

Proof. Now of course side{F, f) C {side{F, f))x^ C C{side{F, f)). Suppose 
h £ C{side{F, f)) and that h{a) ^ 0 for some a £ V{F). Then because h £ 
C{side{F, f)), by definition there exists g £ side{F, f) such that g{a) yf 0, so 
/(a) = 0. Hence h £ side{F, f) by definition, and so C{side{F, /)) C side{F, /). 
Hence the two sets are equal. □ 

For example, letting F = {x\X 2 } and / = xi, one element of side{F, f) is 
g = X2, since for c, d £ L, if cd = 0 and d 0, then necessarily c = 0. In other 
words, for all a £ L^, if a £ V(F) and g{a) yf 0, then /(a) = 0. In fact in this 
case, side{F,f) = {X2), the ideal generated by X2 in Q[X2]. 

There are various kinds of side polynomial, corresponding to the various kinds 
of truth as we shall show. Thus a side polynomial g for (F, /) is 

1. generic if 5 £ Q[C/]; 

2. generically resolving if 5 ^ Q[G] and {F,g) is not generically true; 

3. degenerate if 5 ^ Q[C^] and {F,g) is generically true; 

4. extraneous ii g ^ Q[^] and (F, g) is universally true. 

Theorem 2. The possible theorem (F, /) is 

1. universally true if and only if every polynomial is a side polynomial; 

2. generically true if and only if there exists a generic side polynomial; 

3. generically conditionally true if and only if there exist no generic side poly- 
nomials, but there does exist a generically resolving side polynomial; 

4- degenerately true if and only if there exist no generic or generically resolving 
side polynomials, but there does exist a degenerate side polynomial; 

5. rarely true if and only if all side polynomials are extraneous. 

Proof. Condition 1 is immediate, and condition 2 is proved in | 2 ]. 

Now suppose (F, /) has no generic or generically resolving side polynomials. 
If 5 £ side(F, f) then (F, 5) is generically true, so if h{a) = 0 for all d £ F 
and g{a) yf 0 then a ^ Q{F). Suppose h ^ Q[G] and d is a non-degenerate 
side polynomial for (F, /); hence fh is zero on V(F), so certainly fh is zero on 
fo„(F), so GxAF) C V(/d) = V(/)UV(d). Hence Gx^F) = [fo„(F) n V(/)] U 
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[Qx„{F) n V{h)]. But h is non-degenerate, so h is not zero on all of Gx„{F), so 
Gx„{F) g V{h), so Gx„{F) n V{h) C Gx„{F), and so Gx„{F) n V(/) is a finite 
non-empty union of irreducible components of Gx^iF): let V(G') be one of these 
irreducible components. Then V(G') is a generic irreducible component of V(i^) 
also, and because V(G') C Gxr,{F) n V(/) C V(/), it follows that / is zero on 
V(G'). Hence {F,f) is not generically false. 

Conversely, suppose (F, /) is not generically false. Now V(F) yf 0, so C{F) yf 
Q[X„]. Suppose / is zero on the generic irreducible component V(F') of V(F); 
then V(F') C V(/). If V(F) has only one irreducible component (namely itself), 
then V(F) = V(F') and then / is zero on V(F) and so any polynomial h ^ 
C{F) is a non-degenerate side polynomial for (F, /); these exist because G(F) yf 
Q[X„]. On the other hand, if V(F) has more than one irreducible component, 
let V{Fl) be the union of the irreducible components of V(F) other than V(F'). 
Let h € G(F)\G(F'); such a non-zero h exists since V(F') ^ V(F), so G(F) is 
not a subset of C{F'), and G(F) is non-empty. Further, if F(a) =0 yet h{a) yf 0 
then a G V(F)\V(F) C V(F') C V(/), so h G side{F,f). But h ^ C{F') and 
V(F') C Gxn(F), so h does not vanish on Gx„{F) and so the possible theorem 
(F, h) is not generically true. Hence h is a, non-degenerate side polynomial for 
(F, /). Thus every side polynomial for (F, /) is degenerate if and only if (F, /) 
is generically false. 

Let U' = 0. Then all irreducible components of V(F) are F'-generic, (F, g) is 
F'-generically true if and only if (F, g) is universally true, and a side polynomial 
is F^degenerate if and only if it is extraneous. Then (F, /) is rarely true if and 
only if (F, /) is F'-degenerately or rarely true if and only if every side polynomial 
for (F, /) is F'-degenerate, that is to say, extraneous. 

Finally, there is at least one generically resolving side polynomial for (F, /) 
yet no generic side polynomials for (F, /) if and only if (F, /) is not generically 
true and (by the above) neither degenerately nor rarely true, that is, if and only 
if (F, /) is generically conditionally true. □ 



4 Proof by Refutation and the Kind of Truth 

Because of Hilbert’s Nullstellensatz, algebraically closed fields are algorithmically 
convenient to work with, and we frequently assume algebraic closure of L in 
what follows; moreover all fields have characteristic zero since we work with 
polynomials over the rational numbers. When L is the field of real numbers, a 
geometry theorem being true in any of the senses just defined means that it 
is true in the theory of Euclidean geometry. Of course, if a possible theorem is 
universally true over an algebraically closed field of characteristic zero, then it 
is true over the complex numbers and hence over the reals also. Although the 
converse fails, it seems to do so rarely, a fact which apparently generalises to 
the other kinds of truth as we see in examples to follow. Certainly any side 
polynomial over algebraically closed F is a side polynomial over the reals also. 
Thus the assumption that L is algebraically closed is not totally artificial and 
corresponds to a certain well-defined level of geometrical reasoning which in 
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practice seems only a little weaker than full Euclidean geometry, namely metric 
geometry. We recommend the book j2] to the reader interested in a more detailed 
account of some of these matters, which have been discussed by many authors. 

There are methods which allow one to test whether a given guess is a side 
polynomial for a possible theorem (and these are considered in detail in 0), 
but guessing side polynomials is generally difficult. Furthermore, the existence 
of (say) a resolving side polynomial for (F, /) does not preclude the existence of 
generic side polynomials, so the kind of truth is not necessarily established by 
a correctly guessed side polynomial. Moreover, one can never be sure of having 
a complete set of side polynomials (so that the disjunction of the associated 
inequations covers all possibilities for side conditions) using such an approach. 
A method which is able both to produce a complete set of side polynomials and 
then to read off the kind of truth of the possible theorem is desirable. It turns 
out that the set obtained using Kapur’s method in based on constructing 
the Grobner basis of F U {fz — 1}, does this job. 

Recall from TheoremEthat C{side{F, /)) = side{F, /). We shall call any set 
of polynomials G C Q[A„] for which C{G) = side{F, f) a complete set of side 
polynomials for (F, /). Then certainly, for any a G F", h{a) = 0 for all h G F 
and gi{a) yf OW g 2 {a) yf OV- • -V gk{a) yf 0 imply /(a) = 0, and moreover any side 
polynomial p is such that p{a) yf 0 implies 51(a) yf 0V52(a) ^ OV ■ ■ - V gk{a) yf 0. 
So the disjunction of the side conditions of the form 5^ yf 0 is the weakest possible 
such disjunction. We similarly define a complete set of generic side polynomials 
for (F, /) to be any finite G C Q[U] for which C(G) nQ[C/] = side{F, f) nQ[C/]. 

For FU{/} C Q[A„] and U a non-empty subset of A„, let (F : f°°)u be the 
ideal (F U {fz — l})yf„u{2} Q[U]. (If U = X„, this is the saturation of / with 

respect to the ideal (F)x„-) 

Results similar to the following have already appeared in the literature. 
Theorem 3. For the possible theorem (F, /), side{F, /) = Cx„u{z}{F U {fz — 

i})nQ[A„]. 

Proof. Let B{F, f) = Gx„u{2}(^U{/z— l})nQ[X„]. The following are equivalent. 

— gGB{F,f); 

— g G Q[A„], and if a S Vx„(F) and f{a)b = 1 for some b G L, then g{a) = 0; 

— g G Q[A„] and if a G Vx„(F) and /(a) yf 0, then g{a) = 0; 

— g G Q[^n] and if a G Vx„{F) and g{a) yf 0, then /(a) = 0; 

— g G side{F,f). 

Hence side{F, f) = B{F, f). □ 

Theorem 4. Suppose that L is algebraically closed. Then side{F, f) = G{{F : 
Dx^), and if a lexicographic order is used in which z is the biggest variable 
and the variables in U are all ordered below those in Xn\U, then GB(FU{fz — 
1}) nQ[17] is a complete set of generic side polynomials for (F,f). 
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Proof. 



Cu{{F : f)u) = Cu{{F U {fz - l})x„u{z} n Q[U]) 

Q Cx„u{z}i{F U {fz - l})x„u{.} n Q[C/]) n Q[U] 

C Cx„u{.}iiF U {fz - l})x„u{z}) n Q[U] 

= Cx„u{z}(FU{fz-l})nQ[U] 

= side{F, f) 

from Theorem 01 (Note that this part of the argument works even if L is not 
algebraically closed.) 

Conversely, if L is algebraically closed and g S side{F, f) nQ[C/], then by 
TheoremOl g G Cx„u{z} (^U{/z— l})nQ[C/], so by Hilbert’s Nulltellensatz, there 
exists n > 0 for which g" G (^’U{/z — l})x„u{z} and hence g” G Cjj{{F : f°°)u), 
so by Hilbert’s Nullstellensatz, g G CjjiiF : f)u)- Hence sideiF, f) fl Q[C/j C 
Cu{{F : f)u) and so side{F, f) = Cu{{F : f)u). 

From the theory of Grobner bases, if a lexicographic order is used in which 
the variables in U are all ordered below those in U {z}\C/, then GB{{F U 
{fz — 1}) n Q[U]) = GB{F U {fz — 1}) n Q[U], which is therefore a generating 
set for the ideal {F : f°°)u and hence is a complete set of side polynomials for 
{F, /), since if an ideal I is a complete set of side polynomials for (F, /) then so 
is any generating set for I. □ 

The following is immediate if one lets U = 

Corollary 1. Suppose L is algebraically closed. If a lexicographic order is used 
in which z is the biggest variable, then GB{F U {fz — 1}) nQ[Xn,] is a complete 
set of side polynomials for (F, /). 

This generalises a result in ^ , where it was shown that if a side polynomial 
h consistent with the hypotheses (in other words a non-extraneous side polyno- 
mial) exists, then there will be one in GF(FU{/z — l})nQ[X„]. Our result shows 
that GB{F\J {fz — 1}) nQ[X„] spans all possible side conditions in the natural 
sense described earlier, and moreover this extends to generic side polynomials. 

Suppose L is algebraically closed. Let G(F, /) = GB{F U {fz — 1}) 0 Q[X„], 
computed with respect to a fixed lexicographic order in which z is the biggest 
variable and the variables in U are all ordered below those in Xn\U. 

Let 

Gi = G(F,/)nQ[C/], 

G 2 = {5 I g e G{FJ),g^Q[U],G{F,g)nQ[U] = 0}, 

G 3 = {g\g€ G{F, f),g ^ Q[U],G{F,g) O Q[U] ^ 0, G(F, g) ^ {!}}, 

G 4 = {g I g G G(F, f\g ^ Q[C/], G(F, 5 ) = {!}}. 

We need the condition 5 ^ Q[t7], in the definitions of G 2 , . . . , G 4 in order that 
the sets Gi are disjoint. Gi consists of a complete set of generic side polynomials 
as we have just shown, G 2 consists of generically resolving side polynomials. 
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G3 consists of non-extraneous degenerate side polynomials and G4 consists of 
extraneous side polynomials. Thus the Gi partition G{F, f): G = and 

Gi n Gj = 0 for i ^ j. 

Theorem 5 . Suppose L is an algebraically closed field and U C is a set of 
independent variables. The possible theorem (F, f) is 

1 . universally true if and only if G\ = {1}; 

2. generically true if and only if G\ yf 0 ; 

3. generically conditionally true if and only if Gi = 0 , G2 

emptyset; 

4 - degenerately true if and only if Gi = G2 = 0, G3 yf 0; 

5. rarely true if and only if Gi = G2 = G3 = 0 , G4 yf 0 . 

Proof Since L is algebraically closed, by Corollary P G{F, f) = GB(F U {fz — 
1 }) nQ[X„] is a complete set of side polynomials for {F,f). Consequently, for 
any side polynomial h for (F,f), there exists k G G{F, f) such that h{a) yf 0 
implies k{a) yf 0 , a G L". Hence k is zero on no more irreducible components of 
Vx„ (F) than is h. 

If {F, f) is generically true, then by Theorem El Gi yf 0 ; the converse is 
obvious. 

Suppose {F, /) is generically conditionally true. Then Gi = 0 as otherwise 
there would be a generic side polynomial for (F, /). Furthermore, there exists a 
side polynomial h for (F, f) which is generically resolving. Hence by the above, 
there exists k G G{F, f) which vanishes on no more irreducible components of 
V(F’) than does h. Hence k is either generically resolving or generic. It cannot 
be generic since Gi = 0, so fc G G2 and so G2 yf 0. 

Conversely, suppose Gi = 0 , G2 yf 0 . Then {F, f) is not generically true 
since Gi is a complete set of generic side polynomials, but there is a generically 
resolving side polynomial, so {F, f) is generically conditionally true. 

Suppose {F, /) is degenerately true. Then as above, Gi = G2 = 0 . Also 
as above, because there is a degenerate side polynomial for (F,f), there must 
be one in G{F,f), and so G3 y^ 0 . Conversely, if Gi = G2 = 0 and G3 yf 
0 , then {F, f) is certainly not generically true since Gi = 0 , but neither is 
it generically conditionally true, as if it were, G2 yf 0 from earlier in the proof. 
Thus (F, f) is degenerately true since there are no generic or generically resolving 
side polynomials, but there is at least one degenerate side polynomial. 

Suppose (F, /) is rarely true. Then, again, Gi = G2 = G3 = 0 yet G4 yf 0 . 
Conversely, if Gi = G2 = G3 = 0 , G4 yf 0 , then, as above, (F, /) is not generically 
true, nor generically conditionally true, nor degenerately true, and hence is rarely 
true. □ 

Thus G(F, /) provides a complete set of side polynomials for (F, /) from 
which the kind of truth may be determined along with the relevant side polyno- 
mials and, with luck, a geometrical interpretation of each such side polynomial, 
leading to a geometrical side condition for each. Such computations are illus- 
trated in the following section. 
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5 Implementation and Examples 

The classification procedure given by Theorem 0 can be attached to a standard 
implementation of a refutational prover for algebraic geometry theorems. This 
has been coded in Mathematica using the following algorithm: 

1. Translate geometric predicates in the hypotheses and conclusion to alge- 
braic polynomials, giving F C Q[X] and / G Q[^], respectively. For exam- 
ple, Collinear [a,6,c] (meaning that the points a, 6, and c are collinear) 
translates to the coordinate polynomial 

(x[a] - x[b]){y[b] - y[c]) - {y[a\ - y[b]){x[b] - x[c]). 

Additionally, the construction sequence is used to determine which variables 
are independent, the elements of U Q X, as described in |2j. 

2. Compute the Grobner basis G{F, f) = GB{F U {fz — 1}) fl Q[A], removing 
the polynomials involving z. 

3. Split G{F, f) into the four sets Gi, G 2 , G 3 , G 4 , as defined in the preamble to 
Theorem This involves additional Grobner basis computations. 

4. If Gi = {1} then return True. Otherwise, attempt to translate the polynomi- 
als back into geometric predicates using pattern matching. (If no predicate 
can be determined then the polynomial is returned for inspection by the 
user.) 

The Mathematica code for our implementation can be downloaded from 
http : //www. maths .utas . edu.au/People/dfs/dfs .html 

We now give some simple examples to show the kinds of truth of geometry 
theorems and the results of this algorithm. 



Parallel Pappus 




The following is a famous theorem of Pappus: 

Hyps [Pappus] = { Collinear [A, B,C] , Collinear [D,E,F] , 

Parallel [A , E , B , F] , Parallel [B , D , C , E] } ; 
Cone [Pappus] = Parallel [A, D, C, F] ; 
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The function Prove [F , /] returns the kind of truth of the possible theo- 
rem (F, /) as a 4-tuple {generic, conditional, degenerate, extraneous} of sets of 
equations. In the case where the first of these equals {1} with all others empty, 
the output is rendered as True. 

Prove [Hyps [Pappus] , Cone [Pappus] ] 

True 

Thus this possible theorem is universally true. Any instance of the hypotheses 
is an instance of the conclusion, without restriction. Universal truth is an uncom- 
monly strong property of possible theorems. It means that the entailment holds 
for arbitrary choices of the points. For example, the Parallel Pappus theorem 
holds even when all the points are collinear. 



Collinearity Theorem 

Consider the following statement carefully: 

Hyps [CollinearityThm] = { Collinear [A, B,C] , Collinear [A, B ,D] }; 
Cone [CollinearityThm] = Collinear [B,C,D] ; 

Here the prover chooses U = {xA,yA,XB,yBiXc,XD} from the construction 
and computes 

G{F, /) = {-yA + VB, XA - xb}- 

Each polynomial is in Q[C/] and so Gi = G{F, /), G 2 = G 3 = G 4 = 0. Invoking 
Prove gives the partition of G(F, /) and interprets the polynomials: 

Prove [Hyps [CollinearityThm] ,Conc [CollinearityThm] ] 

{{! Identical [A, B]>, {}, {}, {» 

Thus we have a generic side polynomial for the possible theorem. The theorem 
is generically true, and the associated side condition asserts that points A and 
B are not identical. The prover has established that ! Identical [A, B] is a 
weakest possible (generic) side condition: any other side condition is at least as 
strong. (Readers should draw a diagram for the case where A and B coincide to 
see the problem.) 

Parallelogram Theorem 

The following possible theorem says that the diagonals of a parallelogram bisect 
each other. 

Hyps [ParallelogramThm] = { Parallel [A, B,D,C] , 

Parallel [D, A, C,B] , Collinear [0,B,D] , Collinear [0, A, C] }; 

Cone [ParallelogramThm] = EqualLength [A , 0 , 0 , C] ; 
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Fig. 2. The Parallelogram Theorem. 

Here the predicate EqualLength[a,6,c,d] says that the line segments ab 
and cd are of equal length. 

The prover chooses U = {xA,yA,XB,yBiXc,yc} from the construction and 
computes 



G(F,/) 



' -XByA + XCVA + XAys - xcys - XAyc + XbVC, ' 
xcyB - xoyB - XByc + xoyc + xbVo - xcyo, 
-xcyA + xoyA + XAyc - xoyc - XAyo + xcyo, 
XcyB - XoyB - XByc + xoyc + XByo - xcyo, 
-xcyA + xoyA + XAyc - xoyc - xavd + xcyo, 

, -Xoyc + xoyc + xcyo - xoyo - xcyo + xoyo , 



Here 



Gi = {-XByA + XcyA + XAyB - xcyB - xavc + XByc}, 

{ xcyB - xovB - XBVC + xoyc + XByo - xcyo, ) 

XcyB - XdVB - XBVC + Xoyc + XbVD - xcyo, \ 
-XcyA + XoyA + XAyc - xoyc - XAyo + xcyo, [ ’ 
-xdVc + Xoyc + Xcyo - xoyo - xcyo + xoyo ] 

Ga = 0, 

G4 = {-XcyA + xoVA + XAyc - Xoyc - XAyo + xcyo}- 



Again Prove gives this partition and interprets the side conditions: 



Prove [Hyps [ParallelogramThm] , Cone [ParallelogramThm] ] 



{{ ! Collinear [A,B,C] }, 

{ ! Collinear [A,C,D] , ! Collinear [B,C,D] , ! Collinear [B,C,0] , 

! Collinear [C,D, 0] } , 

o, 

{ ! Collinear [A , C , 0] }} 

This theorem is generically true: the associated side condition states that the 
points A, B, and C are not collinear, and again, any other generic side condition is 
at least as strong as this one. We also have five non-generic side conditions. One of 
these, the side condition Collinear [A, C, 0] is a consequence of the hypothe- 
ses and hence is extraneous. The remaining four side conditions are generically 
resolving. 
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Isosceles Theorem 

In this example, EqualAngle [a, 6, c, d, e, /] says that the angle Za5c is equal to 
the angle Zdef. 

Hyps [IsoscelesThm] = {EqualAngle [A, B,C,C, A, B] } ; 

Cone [IsoscelesThm] = EqualLength [A , C , B , C] ; 

We obtain 

Prove [Hyps [IsoscelesThm] , Cone [IsoscelesThm] ] 

{{}, {ICollinear [A, B, C]}, {}, {}} 

So the possible theorem is generically conditionally true. The conditional 
predicate ! Collinear [A, B, C] identifies which of the two generic components 
gives a theorem. 

A Rarely True Theorem 

Rarely true theorems are not of great interest, but here is an example. Letting 
Midpoint [a , 6 , c] be the predicate that the midpoint of the line segment between 
a and b is the point c, we obtain the following: 

Hyps[NonThm] = {Midpoint [A, B,C] }; 

Conc[NonThm] = Midpoint [A, C,B] ; 

Prove [{Midpoint [A,B ,C] } .Midpoint [A,C,B] ] 

{{}, {}, {}, {! Midpoint [A, B,C]» 

Thus any side conditions for the possible theorem are at least as strong as the 
negation of the hypothesis! That is, there is no component of the hypothesis on 
which the conclusion holds. The theorem fails to hold in a most comprehensive 
way. 

6 Conclusion 

Universal truth was considered in m, as was the Grobner basis characterisation 
given above. Universal and conditional truth were also considered by Wang |B|. 
The definitions of generic truth and non-degeneracy conditions originate with 
Wu Oj , and have been considered also by Chou in [2| , where a variant on the 

Grobner basis method using fields of rational functions is featured. Conditional 
truth in general (meaning neither universal truth nor rare truth) was considered 
in PI along with the Grobner basis method of proof. Generically conditional 
truth was considered in P) though no Grobner basis method was given. In 
two strengths of generic truth were defined in terms of the highest dimension 
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irreducible components of the hypothesis variety, an approach often giving a 
different notion of generic truth to the one used here and one which Chou argues 
in 13 is not always the one intended by the user. The notion of a complete set 
of side polynomials, though hinted at in 0, seems not to have been explicitly 
considered elsewhere. 

More recently, the article |3 takes a similar approach to ours, in that a possi- 
ble theorem (F, /) is classifiable as universally true (called “geometrically true” 
in 0), generically true, neither generically true nor generically false (generi- 
cally conditionally true in our terms), and generically false. However, this is 
done by computing with both (elimination ideals generated by) F U {fz — 1} 
and F U {/}, whereas our approach considers only Grobner bases of the former 
kind of set (that is, side polynomial calculations) . The approach in |Z) does not 
seem to be able to provide information in the generically conditionally true case 
(other than to flag the need for a decomposition), whereas our approach is able 
to provide side polynomials which eliminate the generic irreducible components 
on which the conclusion fails to hold. An advantage of the approach in [Z| is the 
possibility of generating additional hypotheses of equational type (rather than 
just inequations) in the generically false case, although the approach is not guar- 
anteed to do this. Nonetheless, it would be possible to use a combination of the 
approach in f] and our approach in such cases: first, that a theorem is generi- 
cally false could be established using our approach, and then F U {/} could be 
considered in an attempt to obtain sufficient additional equational hypotheses. 

The main contribution of the current work is to bring together facts which 
show that a single Grobner basis calculation for FU{/z — 1} yields a complete set 
of side polynomials {gi,g 2 , ■ ■ ■ , gk} for the possible theorem (F, /), and moreover 
that this (plus perhaps similar calculations of Grobner bases for some of the 
F U {giZ — 1}) is all that is needed to classify the kind of truth of the theorem 
and to provide the appropriate complete set of side conditions. 
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Abstract. We use complex vectors in geometrical reasoning, specifically 
automated theorem proving. The calculations are embedded in Clifford 
algebras, but commutative polynomial techniques can be used. Using 
the Grobner basis package in the computer algebra package Maple, this 
approach is shown to have efficiency benefits in situations where distance 
and angle relations amongst geometrical entities predominate. 



1 Introduction 

The connection between the complex numbers with conjugation, C, and the 
orthogonal transformations of is well-known, and leads to an algebraic rep- 
resentation of geometrical quantities involving plane vectors. The key to this is 
that the inner product of two plane vectors u,v G C is given by (uv + uv) /2 
where u is the usual complex conjugate of u. Thus commonly used geometri- 
cal relations may be expressed in terms of this complex number formalism, and 
the algebraic forms are often more natural and succinct than the correspond- 
ing forms involving coordinates. We shall exploit this natural advantage for the 
purpose of automated reasoning about geometrical configurations. 

We give below a comparison of three formalisms in expressing a number 
of basic geometrical relations and quantities. A point A will have coordinate 
representation (01,02) G and vector representation a = a\ + 021 G C. One 
expression can be obtained from the other using the relations 

O O Oi -I- 02 ^, d O Oi — 02b 

aifd>x(a + a), 02^— (o-o), 

2 Jz 

and dividing through by i if necessary (although in practice, this last step is 
scarcely ever needed). This is not simply a linear change of variables since the 
imaginary unit lies outside the scalar domain. 

We also consider a Clifford algebra representation A = aiX + 02T, where 
X,Y are orthogonal unit vectors {X^ = Y'^ = 1 and XY = —YX) in the rank 
two real Clifford algebra. 



J. Richter-Gebert and D. Wang (Eds.): ADG 2000, LNAI 2061, pp. 143-|1^^ 2001. 
(c) Springer- Verlag Berlin Heidelberg 2001 
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The complex vector representation can also be thought of as derived from 
the Clifford algebra representation; indeed the complex numbers are naturally 
embedded in the rank two real Clifford algebra, by virtue of the fact that the sub- 
algebra of bivectors (which contains all reals as squares of vectors) is isomorphic 
to the complex numbers (see j2] for the details of the gemoetrical significance of 
this in terms of angles). Thus an arbitrary vector X of length 1 (X^ = 1 in the 
algebra) is chosen, and for any other vector A, the product XA is represented by 
a G C while AX is represented by a £ C. Since X^ = 1, all expressions can be 
rewritten to allow this transformation: for example, AB is rewritten as AXXB 
which becomes ab in the complex representation. 



2 Geometrical Statements in Various Formalisms 

The following list gives examples of geometrical quantities and relations ex- 
pressed in the three formalisms. The standard coordinate form is given first in 
each case, followed by the Clifford form and then the complex vector form. Of 
interest is the number of terms in each polynomial. 

1. The length of segment AB. 

(a) (oi — bi)^ + (tt2 — ^2)^, yielding an expression consisting of six terms on 
expansion. 

(b) {A — B)^, yielding four terms on expansion. 

(c) (a — b){a — b), yielding four terms on expansion. 

2. The tangents of the angles ABC and DEF are equal. 

(a) [(a2-62)(ci-6i)-(c2-62)(ai-6i)][(di-ei)(/i-ei)-k(d2-e2)(/2-e2)] = 
[{d2 — e2){fi — ei) — {f2 — e2){di — ei)][{a2 — b2){c2 — b2) + {ai — bi){ci — bi)], 
which, after expansion and simplifications, yields an equation consisting 
of 96 terms. 

(b) (B -C){A- B){D - E){E -F)-{A- B){B - C){E - F){D - E), an 
expression in 30 terms. 

(c) (a — b){b — c){d — e)(e — /) — (a — b){b — c)(J — e)(e — /) = 0, which 
ultimately yields an expression in 30 terms. 

3. The product of two squares of segment lengths, AB^ ■ CD'^. 

(a) [(oi — biY + (o2 — 62)^] [(ci — diY + (c2 — ^2)^], which expands to 36 
terms. 

(b) {A — BY{C — DY, an expression in 16 terms. 

(c) (a — b){a — b){c — d){c — d), yielding 16 terms. 

4. Line AC is parallel to line BD. 

(a) (c2 — a 2 ){di — bi) — (c?2 — &2)(ci — «i) = 0, which has 8 terms upon 
expansion. 

(b) (C -A){D-B)~ {D -_B){C - A), 8 terms. 

(c) (a — c){b — d) — {a — c){b — d) = 0, 8 terms. 

5. Point A lies on line BC. 

Use the previous case with AB parallel to BC, giving expressions with 6 

terms for each formulation. 
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6. Line AC is perpendicular to line BD. 

(a) (c2 — a2){d2 — 02) + (ci — ai)((ii — ai) = 0, yielding an expression with 
8 terms. 

(b) (C -A){D-B) + {D -_B){C - A), 8 terms. 

(c) (a — c){b — d) + {a — c){b — d) = 0, again an 8 term expression. 

7 . A is the midpoint of B and C. 

(a) oi - {hi - Ci )/2 = 0 , 02 - (62 - C2)/2 = 0 . 

(b) A = (B+C)/ 2 . 

(c) o — (6 + c)/2 = 0, hence also o — (6 + c)/2 = 0. 

It is evident that the advantages of the Clifford algebra vector formalism over 
the usual coordinate approach in terms of succinctness are also possessed by 
the complex vector formalism, even though this is a commutative formalism. 
However, the use of conjugation means that essentially twice as many variables 
are needed when using complex vectors as compared to Clifford vectors, the same 
as for the coordinatised versions. 

In the following sections we compare the standard coordinate formalism with 
the complex vector formalism. In particular, in Sections Eland El we describe im- 
plementations and give timing results of proofs using the two representations. 
This is appropriate since, having expressed a theorem in the respective formal- 
ism, both approaches use the same algorithm to carry out the proof. This is not 
the case for the Clifford representation; we leave discussion of this approach to 
later work. 

Throughout, we shall be interested in the solution of various problems using 
the Grobner basis approach [p. For geometrical theorem proving, an alternative 
method is Wu’s HMBI . Wu’s method was the first approach to geometrical 
theorem proving using polynomial algorithms, and is essentially equivalent to 
the Grobner basis approach for this class of problems as is demonstrated in | 2 | . 
We anticipate the same sorts of efficiency improvements using Wu’s prover as 
occur using the Grobner basis approach. However, the Grobner basis approach 
is more readily generalised to non-commutative algebras, and packages based on 
Buchberger’s Grobner basis algorithm are in wider use than are implementations 
of Wu’s algorithm. We use Maple’s built-in Grobner package in the computed 
examples. 

3 Viewing the Complex Vector Formalism 
as a Change of Coordinates 

Geometry theorems may be expressed in terms of polynomial algebra, using 
coordinatisation. Once a field of scalars K is fixed, the hypotheses and conclusion 
of the theorem correspond to certain polynomials in the coordinates of points 
being zero. In standard methods, the field K is assumed to be algebraically 
closed in order to ensure completeness, since then Hilbert’s Nullstellensatz can 
be used. Typically one takes K to be C, the complex number field, which has 
the effect of replacing the real Euclidean plane by C^. 
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The ramifications of working with rather than are explored fully in j2j, 
but essentially the difference is that one proves theorems of metric geometry, a 
generalisation of Euclidean geometry with the property that a theorem is true in 
metric geometry if and only if the corresponding coordinatised algebraic theorem 
is true over any algebraically closed field of characteristic zero. As discussed in 
0, in practice little is lost in working with C rather than R . 

In the present context, working with C rather than R has special signifi- 
cance. Letting i be the imaginary unit in C, the invertible linear transformation 
0 : — >■ given by (01,02) 1— >■ (a, a), where o = oi -I- 02^ and o = oi — 021, 

induces an invertible linear change of variables ip from the polynomial ring 
C[xi,yi,X2,y2, ■ ■ .,Xn,yn] tO C[vi,Vi,V2,V2, ■ ■ ■,Vn,Vn] lu which Vj = Xj + iyj 
and Vj = Xj — iyj for each j. (Note that for any (01,02) £ C^, the complex 
conjugate of Oi -I- m2 is not oi — m2 unless Oi, 02 £ R. However, we retain the 
over-bar notation at the risk of some confusion.) 

Thus, writing /j = ip{fj) and g' = ip{g), if a theorem’s hypotheses are 
captured by the equations /i = 0, /2 = 0 , . . . , /fe = 0 and the conclusion by g = 0, 
then in the transformed coordinates, these become f[ = 0, = 0 , . . . , /(, = 

0 and g' = 0. The second set of equations are all expressed in terms of the 
complex vector formalism, so the more efficient algebraic forms given in the first 
section will feature. Hence the conversion between formalisms can be performed 
internally, with no need to introduce an imaginary unit i: it is already part of 
the field. The significance of this is that the complex vector formulation can 
be viewed simply as an alternative coordinate-based formulation. In a theorem 
proving context, this will mean that all the usual cooordinate-based methods 
apply in full. 



4 Generic Truth and Grobner Bases 

The standard notion of generic truth, discussed at length by Chou in 0 and 
in many places since, turns out to be the appropriate one for most geometry 
theorems encountered in practice. (Most “theorems” turn out to be false due to 
degenerate cases not easily excluded by the user.) This approach requires that 
certain of the variables in a coordinatised possible theorem be specified as pa- 
rameters. The idea is that these coordinates can be chosen in a mechanical way 
on the basis of the ordering of the hypotheses. This ordering contains implicit in- 
formation about a notional “construction” associated with a theorem statement: 
the first-mentioned points are chosen “arbitrarily”, or in general position, and 
then subsequent points are either partially or wholly constrained by the theorem 
hypotheses. In this formulation, theorems are proved to be “generically true” in 
a sense made rigorous in USE]. The idea is that one is only concerned with 
cases of the hypotheses in which the parameters are algebraically independent, 
and Chou gives strong arguments as to why this is a logically and geometrically 
sound approach. Algorithmically, using rational expression fields is shown in P| 
to be a very efficient way of checking generic truth, and also allows one to obtain 
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non-degeneracy conditions, sufficient for the truth of the theorem and expressible 
as algebraic inequations. 

Issues of genericity are readily dealt with in our scheme: if a point is in general 
position, both it and its “conjugate” are viewed as parameters and added to the 
coefficient field for purposes of computations. If the point is viewed as totally 
dependent, both it and its “conjugate” are added to the variable set. If a point 
represented by x in the complex vector formalism has a single notional degree of 
freedom (for instance, if it is constrained to lie on a particular line or circle), then 
X is made a parameter but not x. (Note that this would make no sense unless we 
viewed both x and x as separate and independent complex scalars rather than as 
a vector in the complex plane and its conjugate.) All other variables are left as 
polynomial variables. Once this is done, the methods applying to the standard 
formulation also apply here. 

The results which are the basis of provers of generic truth are well known 
and we state them here. Let 

F = {/l,/ 2 , • ■ • ,/fe} ^ K{m)[n] = K{ui,U 2 , ■ ■ .,Um)[xi,X 2 , ■ ■ .,Xn], 

and let GB{F) be the reduced Grobner basis of F in K(m)[n] with respect to a 
fixed admissible term ordering on the monomials in the Xi. With the definition 
of generic truth given in |1| , we have the following extension of an approach first 
introduced by Kapur (see I9I1UI where this refutational approach is discussed 
in detail), and extended to the generic context via the use of fields of rational 
expressions in |^. 

Theorem 1. 1. If g G GB(F), then the possible theorem represented by the 

algebraic implication /i = /2 = • • • = /fc = 0 g = 0 is generically true. 

2. The possible theorem /i = /2 = • • ■ = /fc = 0 g = 0 is generically true if 
and only if GB{F U {gXn+i — 1}) = {1}- 

In our computed examples, only the first part of Theorem Q is needed in 
order to verify the generic truth of theorems, reflecting the experience of Chou 

in 0- 

Thus our complex vector approach is really a coordinate approach in dis- 
guise. This contrasts with genuinely vector approaches, such as those of Chou, 
Gao and Zhang in which theorems of two and three dimensional geom- 

etry are proved using a radically different formalism based on areas and vol- 
umes, allowing higher level interpretation of the resultant proofs. Similarly, the 
point-based algebraic approach to automated reasoning in projective geometry 
featured in the work of White PI, Sturmfels and Whitely [El uses a funda- 
mentally different algebraic algorithm, Cayley factorization in Cayley algebras, 
to do reasoning in projective geometry. In |B|, Fearnley-Sander and Stokes use 
a variant of Buchberger’s algorithm which applies to Grassmann algebras to 
do automated reasoning in affine geometry. Wang demonstrated that Clif- 
ford algebras with genuine algebraic term rewriting could be used for a range of 
geometrical reasoning. A summary of recent advances in the use of Clifford rep- 
resentations is given in IP- The complex translation related to Clifford algebra. 
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mentioned in the Introduction, also contrasts with the rule-based methods 
that have been used for reasoning with the Clifford formalism. 

5 Implementation 

A feature of Chou’s book |2j is the very large number of theorems proved me- 
chanically using the methods described in the first half of the book. We used 
the implementation of the Buchberger algorithm built into the computer algebra 
package Maple V, running on a SPARC Station 1 (Sun 4/60), to test the advan- 
tages of our approach for a number of the theorems considered in . The choice 
was not random: a theorem was chosen if it featured many distance or angle re- 
lations, a situation we might expect to be better handled by the complex vector 
formulation. However, we have included all results obtained, and we believe that 
the process of identifying theorems that are likely to be more efficiently proved 
by our technique could be fully automated. 

We made no attempt to obtain non-degeneracy conditions, only to prove 
generic truth. The Maple package does not permit the recording of polynomials 
which arise as denominators during a computation. However, a package such as 
the one referred to in P] , which does this and is able to interpret non-degeneracy 
conditions geometrically, could easily be modified to do these things for polyno- 
mials in the complex vector formalism. The sole object here has been to show 
that our change of variables leads to faster processing for a wide class of easily 
identified theorems. 

We remark by the way that some other types of theorems from j2| were 
tested, in particular some purely affine theorems. Here, the results if anything 
favoured the usual coordinate-based approach, but not markedly. No advantage 
was anticipated from the use of the transformed system for such problems. 

Given a finite set of geometrical hypotheses Hi, H 2 , . . . , and a single con- 
clusion C in the points Pi, P 2 , . . . , Pn, the procedure was as follows: 

1. Convert each Hi to a polynomial fi{xi,yi,X 2 , p 2 , ■ ■ ■ ,Xn, Vn) using the con- 
version rules of the first section; similarly convert C to g. 

2. Select independent points Ai, A 2 , . . . , Ar, semidependent points 
Ar+ 1 , Ar +2 ,■■■, As and dependent points As+i,As+ 2 ,...,A„. 

3. Compute G = GB{fi,f 2 , ■ ■ ■ ,fk) using the total degree order with yr+i < 
yr+2 <■■■ <ys < ys+l < 2;s+l < Vs+2 < Xs+2 < ■ ■ ■ < Vn < Xn- 

4. Reduce g using G. 

Only the third and fourth stages take a significant amount of processing time, 
and the times given in Table G] are for the total of those parts only. 

The procedure for the complex vector formulation was the same, the only 
difference being the rules for converting geometrical predicates into polynomials. 

For the standard coordinate approach, the choice of which variables were 
parameters was specified in [3, having been obtained automatically by Chou’s 
prover, and we used the same choice. Determining parameters for the corre- 
sponding complex vector formulation of the theorem was done according to the 
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choice used for the standard formulation. For testing purposes, we used a total 
degree order for both formalisms, whereas 0 used a lexicographic order, so the 
choice of order was less than optimal for the coordinatised system, and likewise 
for the corresponding complex vector formulation. Also, the variable ordering 
choice in |3 was predicated on subsequent use by Wu’s method, and a slightly 
different variable ordering may have been better for maximising the efficiency of 
the Grobner basis method. 

There were some refinements on the above algorithm that should be men- 
tioned. Some efficiency gains were achieved using the standard variables by 
choosing axes conveniently, as was done in |5| as a matter of course. For in- 
stance, the first point could be chosen as the origin and the second as on the 
A-axis, of the form (a;, 0); thus three of the parameters were set to zero, without 
affecting the generic truth of each possible theorem. Mostly these gains could 
be carried over to the complex vector representation: for instance, a vector x 
on the A-axis satisfies x = x. However, using standard coordinates also permits 
the assertion of collinearity and perpendicularity conditions by a careful choice 
of axes, a technique not possible in the complex vector formalism. For example, 
the line from (xi,X 2 ) to (a;i,a: 3 ) is parallel to the line (0,0) to (0,ui), a fact 
which must be expressed equationally using the complex vector variables. 

Because of the reducibility of the coordinate-based algebraic formulation of a 
theorem, ambiguities in the geometrical interpretation occasionally arise. These 
are dealt with in j2] by introducing certain subsidiary equations which rule out 
the unwanted possibilities. Where this arose in our examples, we included a com- 
plex vector version of each such subsidiary polynomial. The resultant complex 
vector polynomials tend to be slightly less simple than the original ones, but the 
advantage of the method for such examples was still clear. 

Were it not for these various refinements, it would be possible to convert 
directly between the standard and complex vector formulations using the sub- 
stitution scheme 

Xj ^ {uj + Vj)/2, yj ^ {uj - Vj)/2i 
for converting from standard to complex vector variables, and 

Uj I— >■ Xj + iyj, Vj I— >■ Xj — iyj 

for the inverse process. 

6 Brianchon’s Theorem for Circles 

We will illustrate the potential computational benefit from the complex vector 
formulation using a collection of examples from 0. We will show the details of 
the application to Brianchon’s theorem for circles (Example 19 in | 2 |), with the 
remaining timing results summarised in Section 0 

Brianchon’s theorem is illustrated in Figure 0 The theorem states that if A, 
B, C, D, E, F are six points on a circle then the three lines through the opposite 
verticies of the hexagon formed by the tangent lines to the circle at these points 
are concurrent. 
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Fig. 1. Brianchon’s Theorem. 



We will start by showing the standard conversion into polynomials using the 
coordinate approach. We construct the hypothesis polynomials by starting with 
point O, the centre of the circle, at the origin and point A on the horizontal 
axis. The remaining points, including the vertices of the hexagon, Ai, Bi, 

Di, El, and Fi, are constructed (in order) as follows. 

Firstly, we fix the points on the circle by setting BO = OA, CO = OA, 
DO = OA, EO = OA, and FO = OA. This large number of equal- length 
relationships will be naturally suited to the complex vector method since each 
once can be captured in fewer terms, as seen in Section 0 Secondly, we construct 
the hexagon’s vertices using the tangent relationships AiB T BO, AiA T AO, 
BiC T CO, BiB T BO, CiD T DO, CiC T CO, DiE T EO, DiD T DO, 
EiF T FO, EiE T EO, and FiF T FO, and (for simplicity) saying Fi is on 
line A\A. Finally we describe the point of intersection, J, by saying J is on line 
BiEi and on line AiDi. The conclusion is then that points Ci,Fi and J are 
collinear. 

This construction gives the following coordinate representation of the points: 
O = (0,0), A = (ui, 0 ), B = {xi,U2), C = {X2,us), D = (0:3, U4), E = {xa,u^), 
F = {x^,uq), Ai = (ui,X6), Bi = {xs,xr), Ci = (a;io,a:9), Di = {xi2,xn), 

El = (xi 4 , Xia), Fi = (ui, ^15), I = {xi7, Xio). 

The list of hypothesis polynomials (each notionally set equal to zero) is then 



F = 



x\ + U2 — u\, X2 + — u\, x'^ + u\ — u\, x\ + u\ — u\, x\ + u\ — u\, 

XiUi — xf + XqU 2 — U2, XsX2 — X2 + X7U3 — U§, 

XsXi — xf + X7U2 — u^,Xii)X3 — -I- XqU^ — u\, 

X1QX2 -2:2-1- XgUz — 2:122:4 — 2:4-1- 2:11^5 — 

2:142:5 - x% + X13UQ - ul, 2:142:4 -2:4-1- 2:13215 - ul, 

X3U1 - xl + xi5Ue - Uq, X12X3 - xl + X11U3 - ul, 

-2:16X14 - X7X17 -I- X7X14 -I- X16X8 -I- X13X17 - X13X8, 

_ -X16X12 - X6X17 -I- X6X12 -I- X13U1 + X11X17 - XiiUi 
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with conclusion polynomial 



g = xgui - xgxir + xi^xn - xi^xw + a;i6a:io - xiqui. 



We find the Grobner basis using the total degree term order with Xi < Xg < 
• • • < X17. Generating the Grobner basis of the hypotheses and reducing the 
conclusion to zero took in excess of 5000 seconds. 

The complex vector formulation proceeds similarly but the refinements made 
in the coordinate case will not necessarily work in the new setting. For example, 
we can still choose O — (0, 0) but to specify that A is on the horizontal axis we 
use A = (rti, mi), saying that the point and its conjugate are the same. Similarly, 
in the coordinate case we could set the point Ai and Fi by noting that both 
would have the same X coordinate as A. This is not possible in the complex 
case and so we must in troduce new dependent variables x'^ and x{4, setting 
Ai = (x'^,xe) and Fi = {x'^/^^^xi^), and also the explicit perpendicularity and 
collinearity constraints that define where they are. 

Thus there are now 19 hypothesis polynomials 



Fc 



' X1U2 — u\, X2U3 — ul,X3U4 — ul,X4U5 — uf, X3U& — u\, 

\x3U2 + \x^Xi — X1U2, ^Uix'3 + ^UiXq — uf, ^XsU3 + ^XyX 2 ~ X2U3, 
^xsU2 + \xrxi - X1U2, \x13U4 + \xgX3 - X3U4, 

5X10U3 + ^XgX2 - X2U3, ^Xi2U5 + lxnX4 - X4U5, 

< Ixi4Uq + ^Xi3X5 - X^Ue, ^Xi4U5 + ^Xi3X4 - X4U5, , 

5X14^6 + 5X15X5 - X5U6, 5X12U4 + 5a;iiX3 - X3U4, 

^16^14 ^17^13 3^7^17 ^8^16? 

X16X12 - X17X11 + xexi7 - X6X12 - X5X16 + X5X11, 

_ -X15M1 + X14W1 - X14X6 + MiXe + X15X5 - M1X5} 



compared to 17 polynomials in F above. The conclusion polynomial is 



g — — X9X17 + X9X14 — X10X15 + X10X16 + X15X17 — X14X16. 



We use the same total degree term ordering as before but with x'5 inserted 
between X5 and xe, and X14 inserted between X14 and X15, to maintain the order 
of construction. The new proof took only 34 seconds. 



7 Further Timing Comparisons 

To conclude we give a timing comparison for a selection of other theorems chosen 
from 0, identified according to their numbering there. Tabled gives compar- 
isons between proofs using the standard formulation and the complex vector 
formulation. The superiority of the vector formulation is fairly consistent, and 
often quite considerable. 
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Table 1. Comparison between proofs nsing standard representation and complex vec- 
tor representation (time in seconds). 



Example 


Chou 


Chou 


Chou 


Chou 


Chou 


Chou 


Chou 


Chou 


from | 2 ] 


19 


21 


39 


40 


43 


44 


45 


48 


Standard 


> 5000 


394 


81 


247 


20 


22 


32 


> 5000 


Complex 


34 


82 


47 


149 


18 


21 


34 


33 



Example 


Chou 


Chou 


Chou 


Chou 


Chou 


Chou 


Chou 


Chou 


from | 2 ] 


63 


73 


80 


94 


96 


106 


109 


115 


Standard 


47 


97 


> 5000 


139 


3791 


2994 


> 5000 


432 


Complex 


170 


25 


897 


28 


8 


28 


33 


29 



Example 


Chou 


Chou 


Chou 


Chou 


Chou 


Chou 


Chou 


Chou 


from [ 2 ] 


128 


144 


162 


240 


243 


271 


277 


302 


Standard 


1600 


3297 


56 


85 


171 


33 


1755 


490 


Complex 


620 


336 


120 


23 


90 


23 


43 


31 



Example 


Chou 


Chou 


Chou 


Chou 


Chou 


Chou 


Chou 


Chou 


from | 2 ] 


303 


309 


310 


311 


315 


317 


347 


379 
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468 


510 


2302 


36 


340 


25 


> 5000 


88 


Complex 


25 


390 


87 


31 


78 


2723 


2190 


15 



Example 


Chou 


Chou 


Chou 


Chou 


Chou 


Chou 


Chou 


Chou 


from | 2 ] 


390 


393 


395 


401 


409 


440 


456 


487 


Standard 


> 5000 


1115 


71 


> 5000 


423 


1545 


10 


41 


Complex 


10 


336 


19 


37 


23 


53 


10 


39 
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Abstract. This paper presents methods for zero and ideal decompo- 
sition of partial differential polynomial systems and the application of 
these methods and their implementations to deal with problems from 
the local theory of surfaces. We show how to prove known geometric 
theorems and to derive unknown relations automatically. In particular, 
an algebraic relation between the first and the second fundamental coef- 
hcients in a very compact form has been derived, which is more general 
and has smaller degree than a relation discovered previously by Z. Li. 
Moreover, we provide symmetric expressions for Li’s relation and clar- 
ify his statement. Some examples of theorem proving and computational 
difficulties encountered in our experiments are also discussed. 



1 Introduction 

Automated reasoning in differential geometry was initiated by W.-t. Wu |2S] 
in the later 1970s as part of his work on mechanical theorem proving and dis- 
covering. Unlike the case of elementary geometry in which Wu’s method has 
proven extremely efficient for proving many theorems, the situation in the dif- 
ferential case is quite different. Wu [7Tl2i1j and his followers (for example, Chou 
and Gao |H| and the second author m) have applied the method to prove the- 
orems and derive relations mainly in the local theory of curves and mechanics, 
where the involved differential algebraic equations are ordinary (i.e., with only 
one derivation variable). The computational complexity becomes much higher 
in the partial differential case, where integrability conditions have to be taken 
into account and term orderings among derivatives need be introduced. In the 
case of two derivation variables, one may apply the method to deal with surfaces 
which has been investigated by Li nn and recently by the second author in 
Pg, a preliminary version of this paper. Along different directions, Carra Ferro 
and Gallo 0 tried to generalize the Grobner basis method |0I for automated 
theorem proving in differential geometry. Li and Cheng f 1 511 tij have proposed to 
combine Clifford algebra formalism with Wu’s method in order to produce short 
and readable proofs. 

The algebraic bases of Wu’s method are general algorithms for triangulariz- 
ing systems of differential polynomials and for decomposing such systems into 
finitely many characteristic systems. These algorithms are developed from the 
classical work of M. Janet, J. M. Thomas, E. R. Kolchin, and Ritt m, and are 



J. Richter-Gebert and D. Wang (Eds.): ADG 2000, LNAI 2061, pp. 154-|17^ 2001. 
(c) Springer- Verlag Berlin Heidelberg 2001 
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applicable to many other problems beyond geometry. The design and implemen- 
tation of efficient algorithms for decomposing differential polynomial systems 
have become the topics of several other researchers. For example, we may men- 
tion the algorithm Rosenfeld-Groebner developed by Boulier and others |,'II4| . 
the subresultant-based algorithm suggested by Li and Wang m for computing 
differential simple systems, the factorization- free algorithm proposed recently by 
Hubert HOI, and other relevant work by these authors. Some of these algorithms 
will be reviewed in Sect. 3 of this paper. In Sect. 4, we shall present an algo- 
rithm for decomposing any regular differential triangular system into simple sets 
such that the intersection of their saturated differential ideals is the same as the 
saturated differential ideal of the triangular system. This algorithm makes use 
of some ideas from the work of Kalkbrener H2| with improvement by the first 
author and is similar to the algorithms given by Hubert m, Boulier and 
Lemaire [S| , but they are different in the fact that our algorithm avoids comput- 
ing explicitly the inverse of some polynomials in an extension field and works in 
the general frame of positive dimension (there is no need to reduce the problem 
to one of dimension 0). 

The main objective of this paper is to apply Wu’s and other related methods 
based on zero and ideal decomposition and their implementations to deal with 
partial differential polynomial systems formulated from geometric problems, with 
a particular intention to study the local properties of surfaces. In fact, we have 
proved several well-known geometric theorems and derived some unknown rela- 
tions about surfaces automatically (see the examples in Sects. 5.1 and 6). An 
algebraic relation between the first and the second fundamental coefficients in a 
very compact form has been derived (Sect. 5.2), which is more general and has 
smaller degree than a relation discovered previously by Li Ha. Moreover, we pro- 
vide symmetric expressions for Li’s relation and clarify his statement (Sect. 5.3). 
Some basics from the local theory of surfaces are collected in the following sec- 
tion and the paper concludes with a mention of a few computational difficulties 
encountered in our experiments. 

The purpose of our work about surfaces is twofold: on the one hand, we take 
differential polynomial systems formulated from geometric problems as practical, 
not artificially constructed, examples to test the applicability and efficiency of 
decomposition algorithms, and on the other hand, we illustrate how differential 
geometric problems can be solved in an automatic way by using such algorithms. 
The results reported in this paper are still primitive and may serve to motivate 
further research on this subject. To achieve a full and effective automation of 
theorem proving and discovering in the theory of surfaces, considerable research 
and effort on both decomposition algorithms and algebraic formulation of geo- 
metric problems are needed. 

2 The Local Theory of Surfaces 

We recall some basic concepts from the local theory of surfaces. Let 



r = r{u, v) = {x{u, v), y{u, v), z{u, v)) 
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be a parametric surface © in three-dimensional Euclidean space. Then 



ru 



dr 

— and r„ 
ou 



dr 

dv 



are the tangent vectors along the lines of u and v in the tangent plane at point 
P (x, y, z) of 6. The first fundamental form of © is 



where 



|drp = Edv? + 2Fdudv + Gdv^, 



E = r 



2 

u 



E = r,. 



G = rt 




dx dx 
“ ^ dulh 




dy^ 
du^ 
dy dy 
du dv 

dr " 

dv 



dzy 
&u) ’ 
dz dz 
du dv ’ 

dv ) 



are called the coefficients of the first fundamental form {first fundamental coef- 
ficients for short) of ©. Except for singular points on ©, we have 

E > 0, G > 0, 5 = EG - E'^ >Q. 

t n 




Fig. 1 

Assume that the tangent vectors r„ and are not parallel at a regular point 
P. Then 

r„ X ry 
” = I 1 

\ru X rf 

is a unit normal vector perpendicular to the tangent plane of © at P. The 
quadratic form 

—dn ■ dr = Ldr 2Mdudv Ndv"^ 
is called the second fundamental form of ©, where 

L = -ru -riu, M = -ru • N = -ru ■ 
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are the second fundamental coefficients of ©. The function 

K= {LN -M^)/ 5 

is defined to be the Gaussian curvature of ©. We recall the following famous 
theorem of Gauss. 



Theorem 1 (Theorema egregium). The Gaussian curvature K of any sur- 
face depends only on the first fundamental coefficients E^F,G and their first and 
second derivatives. More precisely, 



/ 


F F Fy-\Gy 




F F \Fy 


\ 




F G \Gy 


- 


F G \ Gy 




V 


1 e; p _ 1 c; (y 

2 -^u u 2 




\Fy\Gy 0 





where ex — . 2 -Gvv Fy^y ^ Gyy. 

When expanded, the right-hand side of the above equality is a rational ex- 
pression, whose numerator consists of 17 terms, in F, F,G and their derivatives. 
We shall demonstrate in Sect. ISlhow this expression can be derived automati- 
cally. 

The three vectors [ry,ry,n] form a moving frame of © at P. Taking their 
partial derivatives with respect to u and v, we have 



{ ruu = rf^ry -b rfj^Ty -b Ln, 
ryy = Ff^Vy + Ff 2 rv -b Mn, 
ryy = Ff^Vy + Ff^Vy + N TV, 

Uy = [{MF - LG)ry + {LF - MF)ry\/6, 

Uy = [{NF - MG)vy + {MF - NF)ry]/S, 

where 

Ff, = {GFy - 2FFy + FFy)/{25), Ff^ = {2FFy - FFy - FFy)/{25), 

rl2 = {GFy - FGy)/{25), rf2 = {FGy - FFy)/{25), 

rf2 = {2GFy - GGy + FGy)/{25), rf2 = {FGy - 2FFy - FGy)/{25) 



( 1 ) 

(2) 



are the Ghristoffel symbols of the second kind. The equations and 0 are 
called Gauss formulas and Weingarten formulas, respectively. Their integrability 
conditions are given respectively by the following equations: 

r KF = {rk)u - {F,\)y + 

I KF = {Ff,)y - {Ff2)u + rhrf2 + Ff.rl - - {rf^f, 

I KG = {rf2)u - {rl2)v + - {r^f, 

I — i^l2)v ~ (^22)« + ^12^12 ~ ^22^11’ 



Ly - My = L rf2 + M{Ff2 - Fh) - N Ff^, 
My -Ny = L rf2 + M{rf2 - Ff^) - N rf2- 



(4) 
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As usual, Q is called the Gauss equations and 0 the Codazzi-Mainardi equa- 
tions . 

The concepts and results about the local theory of surfaces reviewed above 
are classical and can be found in standard textbooks of differential geometry, for 
example imu . 

3 Decomposition of Differential Polynomial Systems 

In order to reason about problems of surfaces, we shall formulate geometric 
conditions as partial differential polynomial {d-pol for short) equations and in- 
equations with two derivation variables u and v. Proving a known theorem may 
be reduced to determining whether the conclusion-d-pols vanish on the set of 
differential zeros of the hypothesis-d-pol system, and if not, on which part of 
the zero set they do. This is closely related to the problem of radical differential 
ideal membership. Discovering a new theorem amounts to deriving an unknown 
relation from the given geometric hypotheses expressed as d-pol equations. The 
determination and derivation may be made easier when the system of hypothesis- 
d-pols is decomposed into special subsystems that have certain triangular form 
and for which the radical differential ideal membership may be tested by simple 
reductions. In this section, we give a short review of major techniques for the 
decomposition of d-pol systems, which may be used for differential geometric 
reasoning. 



3.1 Differential Polynomials and Triangular Systems 

Let /C be a differential field of characteristic 0 with m derivation operators (5i, 

. . . , Sm- We call 0 — a derivative operator and ii im the 

order of 9. Let xi, . . . ,x„ be n differential indeterminates over /C. For any fc, 
the symbol 9xk denotes the derivative of Xk with respect to (wrt) 0; 9xk is said 
to be proper if the order of 9 is positive. We denote by 72. = /C{xi, . . . , x„} the 
ring of polynomials in the derivatives of Xi, ... ,Xn with coefficients in JC, by [P] 
or [P] the differential ideal generated by the d-pol P or the d-pols in P, and by 
(P) or (P) the algebraic ideal generated by P or the elements of P considered as 
ordinary polynomials in the derivatives. 

There are different admissible orderings by which derivatives can be ordered. 
We shall consider d-pols with a fixed admissible ordering -< for their derivatives. 
Let P be any d-pol in 72\/C. The highest derivative appearing in P is called the 
lead of P and denoted by ld(P). When ld(P) = 9xk, k is called the class of P. 
We call the leading coefficient of P wrt ld(P) the initial of P, and the formal 
partial derivative of P wrt ld(P) the separant of P; they are denoted by ini(P) 
and sep(P), respectively. 

A d-pol Q is said to be partially reduced wrt P if no proper derivative of 
ld(P) appears in Q, and reduced wrt P if Q is partially reduced wrt P and the 
degree of Q is lower than that of P in ld(P). In any case, one can compute a 
partial d-pseudo-remainder Ft! and a d-pseudo-remainder P of Q wrt P; then 
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there exist integers a and (3 such that 

sep(P)“Q - i?' e [P], sep(P)“ini(P)^Q -Rg [P], 

R' is partially reduced wrt P, and R is reduced wrt P. The d-pseudo-remainder 
of Q wrt P is denoted by d-prem(Q, P). 

Let P be any finite set of d-pols in TZ. A differential zero (d-zero) of P is 
an n-tuple (zi , . . . , Zn) in a universal differential extension field of /C such that 
every P G P becomes zero after 9xk is replaced by 9zk, for all 9xk occurring 
in the d-pols of P. We denote by d-Zero(P) the set of all d-zeros of P. The 
set of ordinary zeros of P, regarded as a set of polynomials with the occurring 
derivatives as indeterminates, is denoted by Zero(P). For any Q C P, we define 

d-Zero(P/Q) ;= d-Zero(P) \ d-Zero ({n„«o}). 

and similarly for Zero(P/Q). 

A nonempty ordered set T = {Ti, . . . ,Tr) of d-pols in P \ /C is called a 
differential triangular (d-tri) set if Id(Ti) ^ ^ Id(Pr) and every Tj is partially 

reduced wrt for j > i. A d-tri set is autoreduced if every Tj is reduced wrt 
for j > i. 

Let Q be any d-pol and U a finite set of nonzero d-pols in P. We define 

d-prem((5, T) := d-prem(. . . d-prem((5, ff), ... ,Ti), 
si(T) := {sep(P),ini(r*) | 1 < * < r}. 

The pair (T,U) is called a d-tri system if every d-pol in si(T) does not vanish on 
d-Zero(T/U), and an elementary triangular (e-tri) system if every polynomial 
in si(T) does not vanish on Zero(T/U). 

3.2 Decomposition into Regular Systems 

Let P and G be two d-pols in P \ /C of same class. A A-polynomial (A-pol) of 
P and G is defined as 



Z\(P, G) := sep(G)6»P - sep(P)0 G, (5) 

where 9 and (j) are proper derivative operators with lowest order such that 
ld(6»P) =ld(<^G). 

A d-tri set T is said to be coherent if, for any P, G S T with same class and 
any A{F,G) of the form Q, we have 



JA{F, G) = Ql6»iPi -b • • • -b Qr9rTr, 

where J is a product of d-pols in si(T), Qi G P, and Ti G T with 9iTi < ld(0P) 
for 1 < f < r. 

A d-tri system (T, U) is said to be coherent if T is coherent and each d-pol 
in U is partially reduced wrt T. A d-tri system (T,U) is said to be regular if it 
is a coherent and e-tri system. 
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Given a pair (P, Q) of d-pol sets (called a d-pol system) in TZ, one can decom- 
pose (P,Q) into finitely many regular d-tri systems (Ti,Ui), . . . , (Te,Ue) such 
that 

e 

d-Zero(P/Q) = |J d-Zero(Ti/Ui). (6) 

This can be done by using the characteristic set method of Ritt m and Wu 
or the elimination techniques of Seidenberg m further developed by 
Boulier and others puBuni. Besides their different elimination strategies, the 
two approaches are also distinguished by their ways of dealing with integrability 
conditions: the former uses passive ascending sets and integrability d-pols are 
computed by means of completion and M/I-reduction jl OlHiSj . while for the latter 
integrability conditions are handled with coherence and computed by means 
of Z\-pols mm- It is clear that the two methods and their ways of dealing 
with integrability conditions may be combined or interchanged. Some relations 
between passive and coherent d-tri sets have been established in HE]. 

The significance of regularity of a d-tri system lies partly in the following 
remarkable result pnj: if a d-tri system (T, U) is regular, then 

d-Zero(T/U) = 0 Zero(T/U) = 0. 

Moreover, by means of computing regular d-tri systems the problem of de- 
ciding whether a d-pol P vanishes on d-Zero(T/U) may be reduced to a similar 
problem for ordinary polynomials. To make this precise, we recall the saturation 
of (differential) ideals. Let J be an ideal and F a nonempty subset of TZ. The 
saturated ideal of 3 wrt F is 

3 : F°° := {P G TZ \ FP G 3 for some product F of d-pols in F}. 

3 : F°° is a differential ideal if 3 is a differential ideal, and that P vanishes on 
d-Zero(T/U) if and only if (iff) P belongs to the radical of [T] : 1[J°“. Thus the 
zero decomposition (|ED implies that 



e 

d-Zero([P] : Q“) = jj d-Zero([Ti] : U“). (7) 

The following result mm shows how to reduce the radical ideal member- 
ship problem from the differential case to the algebraic case. Let a d-tri system 
(T, U) be regular. Then, 

— a d-pol P belongs to [T] : iff a partial d-pseudo-remainder of P wrt T 

belongs to (T) : U°°; and 

— both [T] : 1U°“ and (T) : are radical. 



3.3 Irreducibility, Grdbner Bases, and Simple Systems 

In the decomposition (0 it is possible that d-Zero(Ti/Ui) = 0 for some i, and 
radical ideal membership cannot readily be tested without further computation. 
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However, in view of the results mentioned in the preceding section we now can 
work with ordinary polynomials over fC. There are several techniques that have 
been developed to deal with this algebraic problem deduced from d-pol systems. 

The first is to impose an irreducibility requirement, which was proposed by 
Ritt ^ and Wu . In this case, one further decomposes each regular d-tri sys- 
tem (Tj,lUi), considered as an e-tri system, into irreducible e-tri systems, where 
polynomial factorization over successive algebaric extension fields may have to 
be used. In the process of decomposition, an e-tri system that has no zero is 
detected automatically, and the obtained irreducible ones all have zeros. For any 
regular and irreducible d-tri system (T, U), a d-pol P vanishes on d-Zero(T/U) 
iff d-prem(P, T) = 0. Therefore, after a d-pol set P has been decomposed into 
regular and irreducible d-tri systems, whether a d-pol vanishes on d-Zero(P) can 
be tested simply by computing d-pseudo-remainders. 

The second technique, proposed by Boulier and others insj, works by com- 
puting a Grobner basis G of the saturated algebraic ideal (T) : U°° using a stan- 
dard method j0|. The e-tri system (T,U) has no zero iff G contains a nonzero 
element of /C. Otherwise, a d-pol P belongs to [T] : iff the normal form 

of a partial d-pseudo-remainder of P wrt T modulo G is 0. So, in this case 
the radical ideal membership may be tested by means of computing partial d- 
pseudo-remainders and normal forms. 

The third technique, suggested by Li and Wang HHI, proceeds by decompos- 
ing each regular d-tri system, considered as an e-tri system, into simple systems 
m- A simple system is an e-tri system (T, U), in which T is not necessarily 
irreducible but every polynomial of T is conditionally squarefree in some techni- 
cal sense. The essential process for this decomposition is to use the polynomials 
of T to eliminate polynomials from U by computing regular subresultants. The 
squarefreeness condition may be easily accomplished because of the regularity of 
the d-tri system. A simple system (T, U) must have zeros, and a d-pol P vanishes 
on d-Zero(T/U) iff d-prem(P, T) = 0. 

More recently, Hubert uni, Boulier and Lemaire have devised alterna- 
tive (and more specialized) algorithms for computing (normalized/simple) e-tri 
sets representing regular differential ideals. We shall propose another specialized 
algorithm in the next section. 

The above general setting of d-pols will be specialized in our study of surfaces 
with m = 2 and = d/dv, 62 = d/du. Speaking about partial derivatives in 
this case, we usually mean proper derivatives. 



4 An Algorithm for Ideal Decomposition 
of Regular Systems 

In this section we present a new algorithm for decomposing regular d-tri systems 
into d-simple sets. A coherent d-simple set T is a d-tri set that has d-zeros, and 
for which P G [T] : si(T)°° iff d-prem(P, T) = 0. The underlying idea of our 
method is similar to the one described in |5|, but our definition of d-simple 
sets is weaker than the definition of characteristic presentations introduced by 
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Boulier and others, and our algorithm avoids computing explicitly the inverse 
of some polynomials in an extension of 1C. Moreover, our method works in the 
general frame of positive dimension and we do not need to reduce the problem to 
a zero-dimensional one as in rmn . This may simplify the problem to be solved 
and make it clearer and thus is a point of theoritical interest. 

We say that a d-tri set T is a d-simple set if 

[T] : si(T)“ = {PgTZ\ d-prem(P,T) = 0}. 

Let (T, U) be a regular d-tri system. We are interested in computing an irredun- 
dant characteristic decomposition of [T] : 1[J°°, that is a finite family of d-simple 
sets Cl, ... , Cm such that 



[T] : = f| [C,] : si(C,)“ 

i=l 

and the sets of associated prime ideals of [C^] : si(Ci)°° form a partition of the 
set of associated prime ideals of [T] : U°°. 

This problem may actually reduce to a purely algebraic problem. To explain 
this, we have to introduce similar notions for algebraic ideals. Our main tool 
is the algorithm triangSplit given below. For any d-tri set T = (Ti, . . . ,Tj.), we 
define ini(T) := {ini(Ti) | 1 < i < r} and sat(T) := (T) : ini(T)°°. 

A d-tri set T considered as an e-tri set is called a proper triangular sefl if 
sat(T) = {P G 72. I prem(P, T) = 0}. An irredundant algebraic characteristic 
decomposition of an ideal U in 72 is a family of proper triangular sets Ci, . . . , Cm 
such that J = pl™ ^ sat(Ci) and the sets of associated prime ideals of sat(Ci) 
form a partition of the set of minimal associated primes of 3. For simplicity, we 
sometimes omit the adjective “irredundant.” 

The following result slightly generalizes Theorem 6.2 of m and shows that 
the problem can be treated in a purely algebraic way. 

Theorem 2. Let (T, U) be a regular d-tri system in 72. If (T) : sat(Ci) 

is an irredundant algebraic characteristic decomposition, then each Ci is a co- 
herent d-simple set, and [T] : = HiPj] • si(Ci)°° is an irredundant charac- 

teristic decomposition of [T] : U°°. 

Hubert EOi, Boulier and Lemaire have proposed algorithms for computing 
algebraic characteristic decompositions from coherent autoreduced sets. They 
have to work with zero-dimensional ideals in the main step of their algorithms. 
Our algorithm deals directly with triangular sets of positive dimension, using the 
techniques introduced by Kalkbrener HH and developed by the first author, and 
thus it avoids projecting the triangular sets to dimension zero and then lifting 
them up. 

^ A proper triangular set is usually called a regular set or regular chain (see, e.g., 
P2E1). We use the term “proper” because “regular” is used with a different meaning 
in this paper. 
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Let C be a proper triangular set and P a polynomial in 72.. We say that C 
is a simple set if the ideal sat(C) is radical. The algorithm split(C, P) presented 
in m provides two families Ci, . . . ,Cm and Cm+i) • ■ • jCp of proper triangular 
sets such that (see P) 

— ^sat(C) = -ysat(Ci) fl • • • fl Y^sat(Cp), 

— the sets of minimal associated prime ideals of sat(Ci) for all i form a partition 
of the set of minimal associated primes of sat(C), 

— Vi, 1 < i < m, P is not a zero divisor modulo sat(Q), 

— Vi, TO + 1 < i < p, P is zero modulo sat(Ci). 

Moreover, when C is a simple set, the are also simple sets and thus form an 
algebraic characteristic decomposition of sat(C). We then define invComp(C, P) 
as the set {Ci, . . . , Cm} and nullComp(C, P) as the set {Cm+i, ■ • • , Cpj. 

Let U C 72 and C be a proper triangular set. It follows clearly from the 
definition of invComp above that the algorithm invCompSet(C, U) below returns 
a set {Cl, . . . , Cm} of proper triangular sets such that 

A/sat(C) : U°° = i/sat(Ci) fl • • • fl i/sat(Cm)- 

Moreover, if sat(C) : is radical, then Ci, . . . ,Cm are simple sets forming an 

algebraic characteristic decomposition of sat(C) : U°°. 

invCompSet(C, U) 

P := |C} 

for P in U repeat 

^ := Ubg<? 

return S' 

Using these algorithms we devise our main algorithm triangSplit: 

— Input: (T,U) either a regular d-tri system in 72, or a d-pol system in 72 with 
T = 0; 

— Output: (Cl, . . . , Cm} a set of simple sets, which form an irreducible char- 
acteristic decomposition of (T) : 1U°°. 

triangSplit(T, U) 

if T = 0 then return |T} 
let T = (Ti, . . . , Tr) 

U := UUini(T) 

V := (P e U I ld(P) = ld(T,.)} 

U' ;= U\V 
^ := 0 

for C in triangSplit((Ti, . . . ,Tr_i),U') repeat 
B := C U {Tr} 

for A in invCompSet(B, V) repeat 
(p := (PU {A} 



return (p 
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Proof. We show the correctness of the algorithm. If T is empty, then (T) = (0) 
and the result is obvious. Now assume that T = {Ti, . . . ,Tf) with r > 0, and let 
T' = (Ti, . . . , Tr-i) and P = Tr- 

Let (T, U) be a regular d-tri system. By definition, ini(T)(a:) ^ 0 for any T G 
T and x G Zero(T/U). It follows that ini(T) does not belong to any associated 
prime ideal of (T) : and this justifies line 3 of the algorithm. Moreover, we 

have 

(T) : = (T) : : ini(P)“ 

= (T' U {P}) : U'°° : ini(P)°° : V°°. 

Let be the output of triangSplit(T',U'). One may easily verify that (T' U 
{P}) : 1[J'°° = (T' : U {P}) : U'°°, and by induction we obtain 

(T) : U“ = Pi (sat(C) U {P}) : ini(P)“ : V“ : U'°° 

CG<2>i 

= p sat(C U {P}) : V“ : U'“ 

CG<2>i 

according to Proposition 4.3.2 in 

Let X denote the indeterminates, other than ld(P), occurring in T. Let V 
be an associated prime ideal of sat(C U {P}). According to ^ (Theorem 4.3.6), 
Pn/C[X] is an associated prime of sat(C) and consequently an associated prime 
of (T') : U'°°. It follows that U' for any U' G U', and thus 

(T) : U“ = P sat(C U {P}) : V“. 

CG<2>i 

Let C G and {Ai,...,Am} = invCompSet(C U {P}, V). Since (T) : U°° 
is radical, so is the ideal sat(C U {P}) : and from the specification of 

invCompSet the family Ai, . . . , A^ is an algebraic characteristic decomposition 
of sat(C U {P}) : V°°. We thus deduce that 

(T) : U“ = P sat (A). 
ag^ 

This proves the correctness of triangSplit. The termination of the algorithm is 
obvious. 

Let (T, U) be a regular d-tri system and {Ci, . . . , Cm} be the output of our al- 
gorithm for (T, U). Then, P vanishes on d-Zero(T /U) iff P G [T] : 1[J°“ iff the par- 
tial d-pseudo-remainder of P wrt T belongs to (T) : U°° iff d-prem(P, C^) = 0 for 
all f = 1, . . . , m. Therefore, with o, whether a d-pol vanishes on d-Zero(P/Q) 
can be completely decided by decomposing the d-pol system (P, Q) into regular 
d-tri systems and then to d-simple systems and by d-pseudo-remainder compu- 
tations. 
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5 The Fundamental Coefficients of Snrfaces Revisited 

In this section, we apply Wu’s and other related methods based on zero de- 
composition of d-pol systems to study the fundamental coefficients of surfaces 
and their relationships. The application of these methods to deal with geometric 
problems that can be expressed algebraically by means of d-pol equations and in- 
equations is quite straightforward. For differential geometry in three-dimensional 
Euclidean space, sometimes we also need to handle inequalities which are de- 
fined over the field of reals. In this case, some alternative devices have to be 
adopted. For example, we shall use the weaker condition S 0 instead of the 
real condition 5 > 0 in the computation. 

Let us introduce an ordering 17 for the derivatives of xi ^ ^ as follows: 

Sl^S^Xk -< if either ii + 12 < ji + j’2, or ii~\- 12 = j\ + j'2 and k < I, or 

h + *2 = ji + j2, k = I and i2 < J 2 - This ordering is denoted by grlexA in the 
diffalg package of F. Boulier and E. Hubert jS|, which has been used together 
with the authors’ implementation of simple systems and the algorithm triangSplit 
for our experiments. 

5.1 Automated Rediscovery of Theorema Egregium 

To derive the representation of the Gaussian curvature K in terms of the first 
fundamental coefficients and their derivatives, we form a set P of d-pols corre- 
sponding to the equations in O and 0. Let the derivatives of E F ^ G < 
L < M ^ N and those of r„ ^ ^ n be ordered according to 17, and any 

derivative of E, F, G, L, M, N be ordered smaller than any derivative of r„, ry,n. 
Under this (admissible) ordering, a decomposition of the form 0 for the d-pol 
system (P, {5, M}) (e.g., using Rosenf eld_Groebner in diffalg without compu- 
tation of Grobner bases) contains only one element (i.e., e = 1). One may find 
that in the d-tri set there is a d-pol of the form 

g = 5{LN - m2) - /3, (8) 

where f3 is identical to the difference of the two determinants in Theorem ^ 
Therefore, the Theorema egregium of Gauss is now rediscovered automatically 
(under the condition M 0). Moreover, the d-pol set P may be decomposed into 
12 regular d-tri systems. It can be easily verified that g has d-pseudo-remainder 
0 wrt all the 12 d-tri sets and thus g vanishes on the zeros of all these d-tri 
systems. Hence, the Theorema egregium holds true as well in the case M = 0. 

5.2 Relations between First and Second Fundamental Coefficients 

In the process of computing integrability conditions with F = M = 0, Li HZ! 
discovered a polynomial of degree 4 in L, whose coefficients are d-pols in E 
and G, and thus concluded that the second fundamental form is algebraically 
determined by E, G and their derivatives. This d-pol consists of 434 terms and 
thus is quite complex. It is difficult to analyze its geometric meaning. Using 
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a slightly different way of elimination, we show how to derive a more general 
relation in a very compact form. This compact form consists of only 11 terms and 
thus provides us with some possibility to investigate its geometric significance. 
We shall clarify the statement of Li and give symmetric expressions for this 
special case. Moreover, a quadratic relation of L has also been obtained. 

Consider the case M = 0 . Then the Codazzi-Mainardi equations become 

Cl + = 0, 

C2 = N^ + Lri2-Nr^^ = 0. 



Let 7 = f 3 / 5 ] then @ implies that 

c = LN — 7 = 0. 

Now eliminate the variable N from ci and C2 using c by means of d-pseudo- 
di vision; we have 



Pi = d-prem(ci, c), p 2 = d-prem(c2, c). 

Next, compute the Z\-pol ps of pi and p2 (whose leads are and respec- 
tively). Finally, eliminating the derivatives and Ly from p^ using pi and p2 
by d-pseudo-di vision, we get 

r = d-prem(p3, {pi,P2)), 

which is a d-pol of degree 4 in its lead L and does not contain terms of odd 
degrees in L. Therefore, the following relation is derived: 

r = r^L'^ + r 2 L^ -b rg = 0, 



where 

T4 = - -fyF^2 + i{ri2)v, 

r2 = -[7^(Ci2)« + + WriiF^2 + lulv - 77 ™], 

ro = +2^^F^,r^2- 

In fact, the relation r = 0 can be derived automatically by triangularizing the 
d-pol set H = {ci, C2, c} with L, N, 7 and as variables. For this purpose, order 
the derivatives of F22 A F^^ ^ F^2 ^ ^12 ^ 7 according to fl and to be smaller 
than any derivative of L, which is ordered smaller than any derivative of N. 
Then the d-pol r as well as / given below appears in the process of decomposing 
H into regular d-tri systems. However, the complete decomposition of H could 
not be obtained due to the occurrence of very large d-pols. 

After and 7 are substituted by their corresponding expressions, the nu- 
merator of r, when expanded, is a d-pol consisting of 17 666 terms in if, F, G 
and their derivatives. In the case F = 0 , r simplifies to 434 terms, yielding the 
d-pol found by Li [T 7 j . 

From r = 0 and c = 0 , the following theorem may be established. 
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Theorem 3 . For any surface with M = 0 , either T4 = r2 = rp = 0 , or the 
second fundamental coefficient L is algebraically determined by the first funda- 
mental coefficients E, F, G and their derivatives. In the latter case, if L 0 , 
then N and thus the second fundamental form are also algebraically determined 
by E, F, G and their derivatives. 

Moreover, pseudo-dividing p\ by r, we obtain the following quadratic relation 
/ = d-prem(pi, r) = 7(/2T^ + /o) = 0, 

where /2 and /g are d-pols consisting of 46 and 43 terms, respectively, in 7,Tj'2> 
^22i ^12 their derivatives. Therefore, we have the following new theorem. 

Theorem 4 . For any surface with M = 0 , either 7/2 = 7/0 = 0 , or the square 
of the second fundamental coefficient L is a rational expression of the first fun- 
damental coefficients E, F, G and their derivatives, viz., 

t 2 _ fo 

~~Tr 

U 7/0 7^ 0 ; then N and thus the second fundamental form are algebraically 
determined by E, F, G and their derivatives. 

Note that / was first discovered from the triangularization process of decom- 
posing H. We realize that this d-pol can be easily obtained as the d-pseudo- 
remainder of p\ wrt r after we have already seen it. After substitution of the 
expressions of 7, FI2, etc., the numerator of f2L^ + fo, if expanded, is a very large 
d-pol in E, E, G and their derivatives. When F = 0 , the numerator of /2A^ -I- fo 
simplifies to 3 696 terms. 

5.3 The Case F ^ M ^0 

Now we come to the special case F = M = 0 . Then the d-pols r4,r2,rg can be 
written as 

T4 = -^{EGKGu^ - GKE„Gu - EKG.G^ - EGGM, 

Zh/ 

V2 = ~^{2EGK^EyGu - EG^K^Euv + G^K^E,,Ey - E^GK^Guv 

+ E^K^GuG^ - 2E‘^G^KKuv + 2E'^G^KuK„), 

E^GK^ 

ro = ^ {EGKEuv - GKE^E, - EKE.Gu - EGKM, 

where the Gaussian curvature simplifies to 

7 (EuGu + Ef EyGy -\- Gf E^y -\- Guu\ 

4 E 2 
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Theorem 5. For any surface with F = M = 0, one of the following holds: 



(а) GuK = 0; 

(б) GuK ^0 and ^ 



Ey 

~E 



G^ IG, 
G ^ K ’ 



(c) the second fundamental coefficient L is algebraically determined by the 
first fundamental coefficients E, G and their derivatives; if L 0, then N and 
thus the second fundamental form are also algebraically determined by E, G and 
their derivatives. 



In the case EyGyK 0, r^, V 2 , and vq can be written in the following 
symmetric form: 



V4 = - 

r2 = - 
ro = - 



GKGu f Guv Ey Gy Ky\ 

2 ~ ~E ~ ~G ~ ~K ) ’ 

{EGK)^\^EyGu (Eu\ (Gu\ 

2 [eg \e)^ j 

E^G^K^Ey fEuy Eu Gu Ky\ 

2 \Ey E G K ) 




Of course, if C 4 = 0 and V 2 0, then L is also algebraically determined by 
if, G and their derivatives. In the case r^ = V 2 = vq = 0, computation shows 
that in general there does not exist any polynomial relation among L, E, G and 
the derivatives of E and G, that follows formally from the equations dH)"®- 
Now an interesting question is how to geometrically characterize the three 
classes of surfaces. The first two cases, which were not considered in Li’s conclu- 
sion, are not trivial. For example, we have looked at the following three families 
of surfaces. The first is the general surface of revolution 



r{u,v) = (h(u) cos u, / i(m) sin u, fc(u)), 

where it is assumed that h'^ + k'^ yf 0 and h yf 0. The surface includes the sphere 
with h{u) = acosu and k{u) = asinu, the ellipsoid (Fig. 2) with h{u) = acosu 
and k{u) = bsinu, and the torus (Fig. 3) with h{u) = a + bcosu and k{u) = 
bsinu as special cases. For this surface, we have Ey = Gy = Ky = Gyy = 0, so 
it belongs to class (b). 




Fig. 2 



Fig. 3 
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Another surface is the helicoid (Fig. 4) 

r{u, v) = (v cos u, —V sin u, bu), 

which is not a surface of revolution. For 
this surface, G„ = 0 and thus case (a) 
holds. 

Obviously, all surfaces of zero Gaus- 
sian curvature (i.e., K — 0) also belong to 
class (a). The cylinder (Fig. 5) is one of 
such surfaces. Moreover, for all the above 
examples we have r 4 = r 2 = cq = 0. 

On the other hand, it may be verified that n = 0 (i = 4, 2, 0) and fj = 0 
{j = 2,0) are not formal consequences of the equations (Q“®- However, this 
fact does not imply that there must exist surfaces such that 0 or /j 0. The 
reason is that the relations between the fundamental coefficients E, F, G, L, M, N 
and the vectors r„, r^, n expressed by means of the inner product are not taken 
into account in the hypothesis. To include these relations, we need to take the 
components of the vectors r^, and n as variables. This leads to a considerable 
increase in the number of variables and of equations and thus makes the com- 
putation much more complex. We have proved that, in the case F = M = 0, the 
d-pols r 4 ,r 2 ,ro do not belong to the radical of the differential ideal generated 
by the d-pols obtained from and the relations between E, G, L, N and 

ru,Ty,n hy taking the components of the involved vectors as variables. We have 
also observed that the regular d-tri systems whose d-zeros do not make the van- 
ishing of T 4 , T 2 , tq represent some nontrivial components of the zero set. So there 
should indeed exist nontrivial surfaces that belong to class (c). The remaining 
problem is to find such surfaces and their geometric characterization. 




6 Proving Theorems about Surfaces 



As shown in the preceding section, deriving an unknown geometric relation or 
discovering a new theorem may be done by triangularizing the geometric hy- 
potheses with specially arranged variable (and term) orderings. The case of prov- 
ing a geometric theorem is relatively easy because the conclusion-relations are 
given. Theoretically, the theorem may be proved by verifying that the conclusion- 
d-pols vanish on the d-zeros of the d-pol set P expressing the geometric hypothe- 
ses. However, this theoretical approach does not work well because geometric 
theorems are true usually under certain nondegeneracy conditions. It is a good 
strategy to introduce d-pol inequations to rule out some of the degenerate cases. 
Thus, in practice we should formulate the hypothesis of the geometric theorem in 
question as a d-pol system (P, Q) and decide on which part of d-Zero(P/Q) the 
conclusion-d-pols vanish. This can be done by decomposing the hypothesis-d-pol 
system so that its zero set is split into subsets (corresponding to the geometric 
situations), and then deciding on which subsets the conclusion-d-pols vanish. 
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We have applied the zero decomposition techniques reviewed in Sect. 0to 
d-pol systems formulated from problems in the local theory of surfaces, experi- 
menting with several theorems selected from standard textbooks of differential 
geometry and the literature of automated geometric reasoning to see the effec- 
tiveness of differential elimination methods. The following examples (of which 
the first is taken from El) are two of them and serve to illustrate how such 
theorems can be proved automatically. 

Example 1. If a surface r = 
r{u,v) has two unequal con- 
stant principal curvatures, then 
the surface is a cylinder. 

With the formulation given 
by Li [HI, the hypothesis of this 
theorem consists of the follow- 
ing seven d-pol equations 

Pi = LuE - LEu = 0, p2 = LyE - LEy = 0, ps = N^G - NGy = 0, 

P 4 = NyG - NGy = 0, p5 = 2 NyGE - NGuE - LGyG = 0, 

Pe = A NLGE + 2 G„„G'E - GyEyE - GlE - GyGEy + 2 GEyyE - GE^ = 0, 
P7 = 8 LyLG^E - 4 N^G^Ey + 2 GuuGEyE - GyE^E - GlEyE - GyGEyEy 
+ 2GEyyEyE-GEl = Q, 

and the inequation q = EN — GL ^ 0 (which means that the two principal 
curvatures are unequal). Let P = {pi, . . . ,pt}, Q = {E, G, q}, and the derivatives 
oi E < G < L < N he ordered according to Q. Using the refined Seidenberg 
algorithm, the d-pol system (P, Q) may be decomposed into two regular d-tri 
systems (Ti,Ui) and (T 2 ,U 2 ) with 

Tl = [N, Ey, Gy, ELy ~ LEy, ELy ~ LEy], 

T2=[L,Gy, GNy - NGy, 2 EGNy - ENGy - GLGy, 

— 2 EGEyy GE'^ EEyGy — 4G^L^], 
l5i = {E,G,L}, U2 = {E,G,iV} 

such that (0) holds with e = 2. Considering the two d-tri systems as e-tri systems 
and decomposing the first under N Ey Gy < Ly Ly (the ordered leads) 
and the second under L Gy Ny Ny Eyy, we may get two simple 
systems (Ti,0) and (¥ 2 , 0 ). Therefore, we have 

d-Zero([P] : Q“) = d-Zero([Ti] : V“) U d-Zero([T 2 ] : V“), (9) 

where is the set of factors of the d-pols in si(Ti): Vi = {E} and V 2 = {E, G}. 
By means of computing Grobner bases as proposed by Boulier and others |:il4| . 
one may also get two simple sets 

T 4 = [N, Ey , Gy , Ly , ELy ~ LEy] , 

T* = [L, Gy, GNy - NGy,Ny, 2 EGEyy ~ G E^ - EEyGy] 




Fig. 5 
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such that holds as well when is substituted by T* for i = 1, 2. 

The conclusion of the theorem to be proved may be given by five d-pol equa- 
tions (see El)- According to the reduction techniques mentioned in Sect. IT, SI 
one may easily verify that the conclusion d-pols can be reduced to 0 wrt Ti or 
Gi, but not wrt T 2 or G 2 . In other words, the theorem is proved to be true 
for one of the two components. The reason is that the conclusion is formulated 
only for one of two cases shown in Fig. 5. For the other case, we need to swap 
the positions of u and v in the conclusion d-pols as remarked by Li 1171 . The 
resulting d-pols may be reduced to 0 wrt T 2 or G 2 . In any case, the surface is a 
cylinder under the condition EG 0; this condition is already satisfied because 
EG > 0. 

Example 2. If a surface r = r{u, v) consists entirely of umbilics, then it is 
planar or spherical. 

A point P on the surface is called an umbilic if the two principal curvatures 
at P are equal. At every umbilic we have 

L _ M _ N 
E~ Y ~ G' 

Take the d-pol equations (0^(0 together with EM — FL = 0 and EN — GL = 0 
as the hypothesis of the theorem, denote the corresponding set of d-pols by P, 
and let Q = {E, G, <5}, where S = EG — F'^ as before. Order the derivatives of 
E<F<G<L^M^N<r<n according to 17. 

If L = 0, then it is easy to decompose (P U {L},Q) into one regular d- 
tri system (T, U), and one may see that both n„ and ny are contained in T. 
Therefore, n is a constant vector and the surface is planar. 

Now consider the case L yf 0. Then decomposition of (P, Q) as of the form 
(El yields two regular d-tri systems (Ti,Ui) and (T 2 , 1 U 2 ), where 



— (Ti, . . . , Tio), 
Ui =QU{F}, 



T 2 = (F, M, T2, Tg, Ti, Ts, Tg, T', T', T', r(g), 
V2 = {E,G}, 



Ti = EM - FL, 

T2 = EN- GL, 

Tg = ELy — LEy, 

T4 = E'^Ly — ELEy — EFLy + LFEy, 

Tg = Efly LVy, 

Tq = Efly LVy, 

Tr = E^ [2 S{Gyy - 2 + F„„) - EGl - GE^ + 4 

+ (2 FFy - GEy + FEy)Gy T (2 EFy - FEy - EEy)Gy 
- 2 (2 EFy - GEy - FEy)Fy\ - 4 (2 FG - F^)F'^L^, 

Ts = E[2 Sryy + {GGy + FGy-2 GFy)ry - {FGy + EGy-2 FFy)ry] 
-2LGSn, 
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Tg = E[2 Sr^v + {FGu - - {EGu - FE^r,] - 2 LFSn, 

Tio = 2(5t’„„ + (2FFu — GEu — FEy)ru ~ (2-Ei^u — FE^ — EEy)ry 
-2L5n-, 

= ELu — LEu, 

T' = 2 EG{Gu^ + E,y - EGl - GE^ - GE^G^ - EE,G, + 4 G^L\ 
T^ = 2 EGr^^ + GGuTu ~ EG^r^ - 2 G^Ln, 

T^ = 2 EGruv - GEyru - EGuVv, 

T[q = 2 EGvuu — GE^Tu + EEyTy — 2 EGLn. 

In fact, Ti is a simple set and T 4 may be replaced by T'^. Let 



Then one can easily verify that 



d-prem(fc„,Ti) = d-prem(fc„, Tj) = d-prem(/„, Tj) = prem(/„,Ti) = 0 



for i = 1,2. It follows that fc is a nonzero constant and f is equal to a constant 
vector rg, and thus 

n 

r-ro = --. 



Therefore, we have 



|r - ro\ 



M = J_ 

|fc| |fc|- 



This proves that the surface is spherical. 



For differential geometry, the algebraic formulation of problems is not al- 
ways easy and straightforward. We are looking for more geometric theorems 
that are nontrivial and instructive, may be formulated in the setting of d-pol 
equations and inequations, and are computationally tractable. We have also been 
observing the performance of decomposition methods with different variants and 
the computational difficulties involved for theorem proving. Not as easy as in 
elementary geometry, we often encounter very large d-pols when dealing with 
problems about surfaces. The computation is highly sensitive to the variable and 
term orderings. How to design more efficient decomposition algorithms and how 
to apply them effectively to attack problems in the local and global theory of 
surfaces have become interesting questions for our research. 
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Abstract. We discuss algorithmic steps when dealing with realizability 
problems in discrete geometry, especially that of finding realizations for 
a given oriented matroid. After a brief introduction to known methods, 
we discuss a dynamic inductive realization method, which has proven 
successful when other methods did not succeed. A useful theorem in this 
context in the rank 3 case asserts that a one-element extension of a uni- 
form rank 3 oriented matroid depends essentially just on the mutations 
involving that element. There are problems in computational synthetic 
geometry of course, where intuition must help. In this context we men- 
tion the application of the software Cinderella to automated deduction in 
computational synthetic geometry, when studying face lattices of poly- 
topes. 



1 Introduction 

We start with a motivation for studying oriented matroids. When using an 
(n X r)-matrix to describe a geometrical object in Euclidean space, we are actu- 
ally often interested in the equivalence class of all matrices that describe images 
of our object under rigid motions. Oriented matroids form a natural framework 
for such equivalence classes. They are even invariants of the corresponding pro- 
jective space with respect to homeomorphic transformations. For the novice in 
the theory of oriented matroids, we recommend to think of an oriented ma- 
troid with n elements in rank r as an equivalence class of (n x r)-matrices. In 
each such equivalence class we collect matrices that agree on certain combina- 
torial properties, about the relative position of the row vectors of the matrix. 
So, in principle, oriented matroids are purely combinatorial objects that mimic 
the “relative-position behavior” of vectors. While real matrices provide a kind 
of realizable paradigm for oriented matroids, there are, moreover, also oriented 
matroids that do not come from a matrix (still sharing the same abstract com- 
binatorial properties) . They are called non-realizable. One model of an oriented 
matroid with n elements in rank r describes it as an abstract sign vector with 
(") components. The components are indexed by the (") r-tuples (Ai, A2, . . . , A,.) 
with 1 < Ai < A2 < . . . < Aj. < n and in case of an (n x r)-matrix Ai, n > r, the 
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component corresponding to (Ai, A 2 , . . . , Aj.) equals the sign of the determinant 
of the sub-matrix of A4 with rows Ai, A 2 , . • . , A^. While all realizable oriented 
matroids can be generated in this way, a general oriented matroid is such a sign 
vector that satisfies certain conditions of local realizability. We omit the technical 
description of these criteria here and refer to 0 for an elaborate treatment of 
oriented matroid axioms. 

One essential property of oriented matroids is that many geometric questions 
that are usually treated on the coordinate level can be dealt with as well on the 
combinatorial level of a suitably chosen oriented matroid. For instance the convex 
hull of a point configuration P can be calculated from the oriented matroid 
of the matrix of the homogeneous coordinates. Similarly this oriented matroid 
carries enough information to decide whether a given simplicial complex can 
be embedded without self-intersections on the vertices of P. This enables us to 
break up the process of deciding the realizability of a simplicial complex A into a 
two-step procedure. First one enumerates (on a combinatorial level) all oriented 
matroids that are compatible with A. Then one tries to realize at least one of 
them. So, in principle a simplicial complex may be non-realizable for two different 
reasons. Either no oriented matroid was found (this is a purely combinatorial 
statement) or the possible oriented matroids turned out to be non-realizable 
(this is a geometric statement). 

The notion Computational Synthetic Geometry has been introduced in 
It deals with realizability problems in discrete geometry, see also HH. An es- 
sential subproblem in this field is that of finding realizations for given oriented 
matroids (i.e. given the sign vector, find a matrix that has this vector as oriented 
matroid) . By observing that determinants are a special kind of polynomials, the 
realizability problem turns out to be a semialgebraic problem of finding solu- 
tions of a system of real polynomial equations and inequalities. A first naive 
attempt to the realizability problem would therefore try to apply the more or 
less standard techniques and algorithms of real algebraic geometry. However the 
general complexity behavior of this problem is known to be intrinsically hard 
and a general algorithm is far from being applicable for practical purposes, see 
mg. One might think that there is some hope that the realizability problem 
for oriented matroids turns out to be simpler than general semialgebraic prob- 
lems, since determinants are very special polynomials. However it turns out that 
this is not the case: the universality theorems of Mnev HH show that the real- 
izability problem and the problem of solving systems of polynomial inequalities 
are essentially equivalent. See also the closely related results about realization 
spaces of polytopes of Richter-Gebert [zq. On the other hand, several heuristic 
methods for deciding the realizability of oriented matroids have been applied 
successfully in the past. Starting with a brief introduction to the known meth- 
ods, we discuss an additional method, a dynamic inductive realization method, 
which has proven successful when other methods did not succeed. We provide 
in particular a useful theorem in this context in the rank 3 case which asserts 
that the one-element extension of a uniform rank 3 oriented matroid depends 
essentially just on the mutations (see below) involving that element. 
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There are problems in computational synthetic geometry of course, where no 
formal heuristics did provide a solution and intuition must help. In this context 
we mention the application of the software Cinderella for automated deduction 
in computational synthetic geometry, when studying face lattices of polytopes. 

We start with a typical example in computational synthetic geometry: the 
realization problem for a triangulated 2-manifold. We exemplify how to proceed 
algorithmically. A fundamental subproblem will be that of finding coordinates of 
a given oriented matroid. As mentioned above we know that a general algorithm 
for this problem is unsuitable for practical applications. So we have to look at 
fast heuristic methods instead. 

Example 1. We consider the following abstract list of triangles, an orientable 
triangulated 2-manifold: 

125 127 137 138 146 148 156 236 238 245 248 267 345 347 356 467 
Input: A triangulated combinatorial torus. 

Problem. Can we find 8 corresponding points in Euclidean 3-space such that 
the flat triangles defined via the above list have no self-intersections? 

Step 0. Finding an admissible oriented matroid. 

Assume for a moment that we have found such 8 points, and write their homo- 
geneous coordinates as an 8 x 4 matrix A4. One can compute the signs of all 
determinants of 4 x 4 submatrices of At. The sign of such a quadruple tells us 
whether the corresponding four points form a left- or a right-handed system in 
space. When we consider a collection of 5 rows of this matrix At corresponding 
to the vertices of a triangle and a line segment of our example, the corresponding 
5 signs of determinants of 4 x 4 submatrices tell us whether the edge pierces the 
triangle or not. We ask the reader to confirm this. 

Having this is mind, we look for a sign structure that satisfies two properties. 
On the one hand it should have the chance to be the sign structure of the 
determinants of 4 x 4 submatrices of a matrix A4 (i.e. we look for an oriented 
matroid) and on the other hand, the sign structure should not violate any of the 
intersection properties that are forced by the simplicial complex. In other words, 
we determine an admissible oriented matroid in rank 4. We skip here the method 
for generating them. An effective algorithm can be found in 0. The output of 
the algorithm can be one of two possibilities: 

Either: The set of admissible oriented matroids is empty. 

If this were the case, it would be impossible to embed the example under 
consideration. 

Or: Admissible oriented matroids were found. 

If this were the case, the embedability of the example depends on the question 
whether at least one of these oriented matroids is realizable. If there is more than 
one admissible oriented matroid, then the torus is not embeddable if and only if 
all of them are non-realizable. 
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In our torus case, we have found the following admissible oriented matroid 
with 8 elements in rank 4 written in terms of signed bases. 



-1234 -1235 -1236 -1237 +1238 +1245 +1246 +1247 -1248 +1256 
+ 1257 -1258 +1267 -1268 -1278 -1345 -1346 -1347 +1348 +1356 
+ 1357 +1358 +1367 +1368 +1378 -1456 -1457 +1458 -1467 +1468 
-1478 +1567 -1568 -1578 -1678 +2345 +2346 +2347 +2348 +2356 
+2357 +2358 +2367 +2368 +2378 +2456 +2457 -2458 +2467 -2468 
-2478 -2567 -2568 -2578 -2678 +3456 +3457 +3458 +3467 +3468 
+3478 -3567 -3568 +3578 +3678 -4567 +4568 +4578 +4678 -5678 



Now the problem reduces to finding coordinates of points that generate exactly 
the sign pattern of the oriented matroid. Any such realization will automatically 
be an embedding of the torus under consideration. 

We use the following method when the problem is small enough (i.e. when we 
can be optimistic about finding a decision for the forthcoming inequality sys- 
tem). The advantage of the following method is that we do not give up the full 
generality. Nevertheless, we can also start with any other alternative mentioned 
later. 



Step 1. Careful analysis to determine the unit matrix. 

Since the oriented matroid of our matrix M is invariant under multiplication by 
a 4 X 4 matrix with positive determinant, we can assume that the rows 1, 3, 5 
and 7 form a unit matrix. In our oriented matroid the sign [1, 3, 5, 7] is positive, 
which is consistent with the sign of our chosen basis. If this were not the case we 
would have w.l.o.g. to realize the oriented matroid where all signs are reversed. 



M = 



/I 0 0 0 \ 
A B C D 
0 10 0 
E F G H 
0 0 10 
I J K L 
0 0 0 1 
\M nop) 



/ 1 0 0 
a b c 
0 1 0 
-e -/ -g 
0 0 1 
—i—j k 
0 0 0 
y —m —n —o 




1 ) 



Having chosen certain rows to be a unit matrix, considerably simplifies the 
algebraic structure of the system of inequalities. While we before had to deal 
with a system of many 4x4 determinants, now (due to all the zeros in the 
matrix) many determinants can be expressed as 1x1, 2x2, or 3x3 determi- 
nants of suitable sub-matrices. In particular the variables A ... P themselves 
can be expressed as a determinant (e.g. P = [1,3, 5,8]). By this the oriented 
matroid itself determines the signs of the variables. The signs of all variables 
are determined when we know the signs of the following determinants: [2357], 
[1257], [1237], [1235], [3457], [1457], [1347], [1345], [3567], [1567], [1367], [1356], 
[3578], [1578], [1378], [1358]. 
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Now we substitute all negative variables v with —v in order to have positive 
variables in A4' only. In what follows all calculations refer to the matrix Ai' , in 
which the variables a, ... ,p have to be positive in order to get a realization of 
our oriented matroid. 



Step 2. Finding a minimal reduced system. 



The selection of the rows that are chosen to be a basis is not at random. The 
arguments that lead to this particular choice will be discussed now. See e.g. 0 
for a more detailed example. 

It is a well known fact that the values of sub-determinants in a matrix are not 
independent from each other. They adhere to algebraic dependencies; so called 
GraBmann Pliicker relations (for instance we have [1357] [1368] — [1356] [1378] -I- 
[1358] [1367] = 0, as you can easily verify for the matrix Ai'). Not all terms in 
such an equation can be positive at the same time. This implies that already 
a subset of signs of determinants determines all of them. For a given choice of 
a unit matrix and a given oriented matroid we can look for a “small” reduced 
system: a subset of determinants whose signs determine all other signs such that 
a weighted sum of the degress of the corresponding polynomials is as small as 
possible. In our example the weights should be chosen in a way such that the 
1x1 submatrices (i.e. the variables) are “for free”, 2x2 are “cheep”, 3x3 are 
“expensive” and 4x4 are “very expensive.” 

We skip arguments in which GraBmann Pliicker relations are used for finding 
a minimal reduced system. We are finally left with the following subsystem of 
inequalities. 



cp 


< 


do 


[1238] 


bh 


< 


df 


[1245] 


bo 


< 


cn 


[1278] 


gn 


< 


fo 


[1478] 


di 


< 


al 


[2356] 


aj 


< 


bi 


[2567] 


Im 


< 


ip 


[3568] 


/* 


< 




[4567] 



f -gh 
j k I 
n —o p 



< 



0 



a c d 
e gh 
mop 



< 0 



[1468] 



[2348] 



So far we did not loose any generality. Every solution of the above system of 
10 equations together with the positivity restriction for the variables provides a 
realization of the oriented matroid. The choice of the rows that constitute the 
unit matrix is such that this system of inequality is as cheap as possible with 
respect to our weight function. 

Step 3. Starting the solvability sequence method (for details see H3l b 



First we observe that we can choose k very large: This variable occurs only in 
the inequality [1468] < 0. This determinant can be written as [1468] = k ■ {fp — 
hn) -!-••• = —k ■ [1458] -!-••• such that k does not occur in the remaining terms. 
The oriented matroid tells us that [1458] > 0, hence the choice 



implies that the term containing k dominates all others and we get [1468] < 0 
(under the new side condition [1458] > 0). 
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We eliminate k by this argument and add the euqation fp < hn to our 
system. Now we only have to consider the 2x2 determinants and the last 3x3 
determinant. Next we analyse the signs of the gradients for each variable in each 
inequality. We can do this by similar signature arguments that we used for the 
elimination of k. 



a m b lit c d 

e / itt 5 h 

t Uttj it k ; it 

m iJJ. n it o ittJIP itlT 



Here a i or IJ. means that there is an equation that tells us that the corre- 
sponding variable should be chosen “small” in order to satisfy the inequality. 
(Gradients that come from a 2 x 2 matrix are marked by i, and gradients that 
come form a 3 x 3 matrix are marked by 1|. Siminarly for f and fl'.) 

If we consider the variable g we see that making this value positive and close 
to zero improves all inequalities in which it is involved. Moreover the equation 
gn < fo will be automatically satisfied if g > 0 is chosen very small for any fixed 
positive choice of n, f and o. Thus a choice 5 — >■ 0 eliminates this variable 



and the inequality [1478] < 0 from our considerations. Next we can apply the 
same argument to m and get m —>■ 0. Having eliminated the inequality [1478] < 0 
variable n only remains in 2 x 2 inequalities on the right side. Choosing it large 
enough we can eliminate n and the corresponding equations. Now we skip all 
details that help us to proceed iteratively in essentially the same manner. We 
only list the resulting sequence of variables. 



bo fp ^ „ 

c > h ^ 

M 1 

a 


i = 1 

- < j < - 

e a 


6=1 
d ^ J ^ a 




a=c=d=e=p=l 


h = 0.5 


0= 1.2 





It is important to mention that the whole procedure we sketched can be defined 
in a rigurous algorithmic way (see jSl)- It can essentially be carried out on a 
completely combinatorial level. Going all the way back, we find that 



1 


0 


0 


0 


1 


1 


1 


-1 


0 


1 


0 


0 


-1 


-0.6 


-0.2 0.5 


0 


0 


1 


0 


-1 


-0.8 


2 


1.5 


0 


0 


0 


1 


-0.5 


-2 


-1.2 


1 



is a possible realization of the oriented matroid: A solution was found. 
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A contradiction would lead to a final polynomial (see El)- 
What can we do, when we get stuck with our system of inequalities? 
Alternative 1. Trying to find a bi-quadratic final polynomial (see EH)- 
Alternative 2. Trying a combinatorial reduction method (see m)- 

In an investigation of finding the set of all non-realizable uniform 10 element 
oriented matroids in rank 3 in jO] , our Alternative 1 and Alternative 2 was applied 
successfully but a final set of about 143 different cases remained undecided. This 
was the birth of Alternative 3 suggested by Bokowski which will be described 
below. The method started with a rubber band model. It was implemented by 
K.P.Pock, support was given by J. Richter-Gebert in the rank 3 case. There is 
also a later implementation by J. Scharnbacher in the rank 4 case. 

Alternative 3. Trying the dynamic inductive realization method (see details in 
this article). 

Alternative 4. There is always hope for some intuition, compare our last sec- 
tion. 

For our dynamic inductive realization method, we have now to introduce 
more notations. 

2 Configurations and Arrangements 

We use the notation of a recent paper [HI in which a direct proof of the equiv- 
alence of the hyperline sequence representation and the Folkman Lawrence rep- 
resentation of an oriented matroid in the rank 3 case was established. Since we 
use in this article both models and their interaction, it is useful for the reader 
to have a look at that paper. We include the following definitions especially for 
the reader not familiar with the theory of oriented matroids. 

We introduce several configurations and arrangements representing geomet- 
rically a class of matrices. It is useful to think simultaneously of all these models 
and to pick the most convenient one for a particular application or argument. 

We consider a non-degenerate vector configuration in R^, i.e. a finite ordered 
set Vn = {vi, . . . ,Vn} C R^, n > 3, Vi ^ 0, i = 1, . . . ,n, such that the one 
dimensional subspaces generated by Vi, i = 1 , . . . ,n, are pairwise different and 
such that the corresponding n x 3 matrix M with Vi as its i-th row vector has rank 
3. The vector configuration will be viewed as a representative of the equivalence 
class of matrices c/„(M) := {M' \ M' = D M, D = diag{Xi, X 2 , ■ ■ ■ , A„), Ai > 
0 ,* = 

A vector configuration Vn induces an arrangement of oriented central planes 
Hn = {h\, ■ ■ ■ ,hn}, via the concept of polar duality. The unoriented plane of 
hi is given as the zero space {x = (xi, 0:2, 2:3) G R^\hi{x) = 0} of a linear 
form hi{x) = -I- Vi.^X 2 + Vi^^xs, Vi = {vi^,Vi.^,Vi,^) 7^ 0. The positive and 

negative sides of an oriented central plane are the two induced half-spaces hf : 
{a; I hi{x) > 0} and h~ : {x\ hi{x) < 0}. 
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An arrangement of oriented central planes Hn induces an arrangement of 
oriented great circles C„ = {ci, . . . , c„} on the 2-sphere and vice-versa. An ori- 
ented central plane cuts the unit sphere S'^ in along a great circle which we 
consider to be parameterized and oriented such that, when looking from outside, 
the positive half-space lies to its left when the parameter increases. 

A vector v ^ 0,v £ induces a directed line ly : {av\a G R} through the 
origin, which intersects the sphere in two antipodal points Sy (in the direction 
of v) and (in the opposite direction). A vector configuration V„ induces a 
configuration of points on the sphere, S'„ = {si, S 2 , ■ • • j Sn}> where Si = Sy^,i = 
1, . . . , n. Each point p on the sphere has an associated antipodal point p. 

We carry over the previous polar dual pairs to the affine plane T, viewed 
as a plane tangent to the 2-sphere. We assume that Vi,i G {1, . . . ,n} is neither 
parallel nor orthogonal to the plane T. 

The great circle parallel to T defines two open hemispheres. One of them, 
called the upper hemisphere, contains the tangent point of T. An oriented great 
circle Ci induces an oriented half-circle in this upper hemisphere which projects 
to an oriented straight line l'^{ci) in the plane T via radial projection, and vice- 
versa, any oriented straight line in T defines an oriented great circle on 5^. An 
arrangement of oriented great circles induces an arrangement of oriented lines 

= {^1) • ■ • )^n}, where k := i^{ci), in the affine plane. 

The same transition from the sphere to the plane T leads from a point 
configuration on the sphere to a signed point configuration in the affine plane. 
We define sp^(sj) to be a pair of a signed index and a point Pi £ T obtained via 
radial projection from Si, as follows. A point Si on the upper hemisphere maps to 
the pair sp’^{si) = (i,Pi), i £ {!,..., n}, and a point Si on the lower hemisphere 
maps to a pair (f,p^) and pj := pi £ T. We obtain from S'„ = {si, . . . , s„} a 
signed point configuration = {spi, . . . , spn}, with spi := sp^(sj), and vice 



versa. 
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3 Hyperline Sequences of Configurations 
and Arrangements 

We use En = {1, . . . , n}, endowed with the natural order, to denote the index set 
of geometric objects like vectors, planes, great circles and points on the sphere, 
lines and points in the Euclidean plane, or of a finite ordered set of abstract 
elements. The associated signed index set E„ = {1, . . . , n, 1, . . . , h} makes it 
possible to denote orientations or signs of these elements. The s i— s operator is 
required to be an involution: s = s. Vs G E„. 

All ordered sets E„, C„, L^, Pj above can be viewed as geometric rep- 

resentations of the same equivalence class of matrices c?„(M). We can reorient 
the elements. The reorientation classes are the equivalence classes with respect to 
reorienting subsets such as vector configurations or central plane arrangements, 
great circle arrangements or pairs of antipodal points on the 2-sphere, line ar- 
rangements or point sets in the plane. These reorientation classes are obtained 
when the numbers Ai yf 0 can be negative as well. The reorientation of a vector 
Vi is the vector vj = —vt and the reorientation of an oriented central plane is the 
change of the sign of its normal vector. The reorientation of an oriented great 
circle or of an oriented line means replacing it by the same object with the re- 
versed orientation. The reorientation of a signed point {i,Pi),i G P„ is the signed 
point {i,pj),pi = pj. The reorientation of an index i is its replacement with i. 
The relabelling of an ordered set is given by a permutation of its elements. 

We now extract combinatorial information from all the geometric sets defined 
above. We will work only with signed subsets q C E„ which do not contain 
simultaneously both an element i and its negation i. li q G P„, we define q = 
{s|s G q}. The unsigned support supp{q) C P„ of g C P„ is obtained by ignoring 
all the signs in q. A signed partition of P„ is a signed set I = /+ U /“ with 

/+,7^gp„, i+uJ^ = e„. 

Definition 1. A hyperline sequence hsi over E„,i G E„ with half-period length 
ki is a pair hsi = (i,TTi), where Tii is a double infinite sequence tt* = (qj)j^z with 

q] C En\ {i,i}, q] = q]+kp ^ supp{[j-^z g]) = P„ \ supp{{i}), where the 
unsigned supports of q{, . . . ,q\. are mutually disjoint. We consider hsi = (f,7ri) 
and hsj = (*,7ry) to he equivalent when TTi is obtained from ttj by reversing the 
order. 

The name hyperline for a subspace of codimension 2 is justified by the concept 
in higher dimensions. In the particular case when all the g*’s are one-element 
subsets, the sequence is said to be in general position, simple or uniform, and 
we replace the sets g* with their elements. In this case, any half period of tt^ 
is a signed permutation of E„ \ .supp{{i}). In general we have an additional 
ordered partition into pairwise disjoint subsets of the signed elements. An infinite 
sequence TTi in a hyperline sequence hsi = (i,TTi) can be represented by any half 
period, i.e. by any ki consecutive signed sets ql_^_l, . . . , dl+ki^ C En\{i, i}, t G 
Z. 
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Example 2. (I^y) = (1: (• • • : {5}, {2, 4}, {3}, {5}, {2, 4}, {3}, . . .)) is a hyperline 
sequence over E^ = {1 . . . 5}, with half period length ki = 3. 




We obtain the normalized representation hsr = (r, tt ^) of a hyperline se- 
quence hsi = (ijTTj) by first choosing (r, tt^) := (i,7Ti) if i G E„ or (r, tt^) := 
(i,reverse(7Ti)) if i G En, and afterwards choosing the half period of tt^ starting 
with the set gj C containing the smallest positive element. 

Example 3. The normalized representation of the hyperline sequence in the 
previous example is (1, ({2, 4}, {5}, {3})). From now on, we will use the more 
convenient notation (1 : {2,4}, {5}, {3}). 

To a signed point configuration = {{i,Pi) \ i G 1} (obtained from a vector 
configuration as described above) we associate a set ElS{Pff) = {hsi, . . . 
of n hyperline sequences hsi = (i,TTi) over En- The sequence tti, denoted by a 
half period q\, q\, . . . ^ q],,, with g* C En \ {i, i}, corresponds to the signed point 
{i,Pi) G Pn ■ It is obtained by rotating an oriented line in ccw order around pi if 
i G En or in cw order around pi if i G En and looking at the successive positions 
where it coincides with lines defined by pairs of points (pi,pj) with pj ^ pi. 
When Pn is not in general position, several points may become simultaneously 
collinear with the rotating line, and they are recorded as a set g}. If the point Pj 
of the signed point (j,Pj) is encountered by the rotating line in positive direction 
from Pi, it will be recorded as the index j, otherwise as the negated index j. The 
whole sequence is recorded in the order induced by the rotating line, and an 
arbitrary half-period is chosen to represent it. 

Definition 2. The rank 3 oriented matroid induced by hyperline sequences as- 
sociated to a signed point configuration P^ = {{i,Pi)\i G I}, where I is a signed 
partition of En, is HS{Pn) = {hsi = {i,TTi) \ i G 1} as described above. We 
identify with {(i,TTi) \ i G /}. 

Note that if the orientation of the plane T is reversed, all the sequences are re- 
versed. The identification in the previous definition makes the notion of hyperline 
sequences independent of the chosen orientation of the plane T. 
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Remark. When we start with a set of vectors Vn and two admissible tangent 
planes T and T' , by radial projection we obtain two sets of signed planar points 
Pj and Pj . The reader can verify that our definition ensures that the result- 
ing hyperline sequences HS{P^) and HS{P^ ) will coincide. This allows for a 
definition of hyperline sequences associated to any of the previously considered 
geometric ordered sets: vectors, oriented central planes, etc. 



z 




Fig. 3. Arrangement C 5 of oriented great circles on the 2-sphere. 



Consider an arrangement C = {ci,...,c„} of n oriented great circles on 
the sphere S'^. To each circle Ci associate a hyper line sequence by recording the 
points of intersection (ordered according to the orientation of the circle Ci) with 
the remaining oriented circles. An index j is recorded as positive (resp. negative) 
when the circle Cj crosses Ci from left to right (resp., right to left). 

An arrangement of oriented lines Pj = {^i, . . . , Z„} induces a set of n hyper- 
line sequences HS(Lj^): for each line li, record the points of intersection with 
the other lines (ordered according to the orientation of the line). Each element 
j is signed: positive if line Ij crosses li from left to right, negative otherwise. 



Example 4. For the arrangement of oriented great circles in Fig. El we have 
the following induced set of normalized representations of hyperline sequences 
HS{C5). We get the same set of normalized representations HS{M) of hyperline 
sequences for M: 



HS{Cii) = HS{M) 
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4 Pseudoline Arrangements and Hyperline Sequences 

A pseudoline arrangement A in the projective plane is a set of simple closed 
curves such that every two curves have precisely one point in common, at which 
they cross each other. We exclude the case when all curves have a point in 
common. Let T be the group of homeomorphic transformations of the projective 
plane. For an arrangement A we have the equivalence class of arrangements 
cl(A) := {A'|A = tA,t G T}. We always consider pseudoline arrangements 
A as representatives of their equivalence class cl (A). An arrangement and its 
mirror image are identified. 

To introduce a suitable concept of orientation we single out a “line at infinity” 
of the projective plane. We embed the usual Euclidean plane in the projective 
plane with respect to this line at infinity. An oriented pseudoline different from 
the line at infinity is a simple closed curve which is parameterized and oriented 
such that the “left side” denotes the side which lies to its left when the parameter 
increases and the line at infinity has been removed. The projective plane without 
the line at infinity is the left side, or the right side, of the oriented line at infinity 
when its orientation is ccw, or cw, respectively. 

Definition 3. The oriented matroid given by an arrangement of n oriented 
pseudolines is the equivalence class with respect to homeomorphic transforma- 
tions of the projective plane of a finite ordered set of n simple oriented closed 
curves (oriented pseudolines) such that every two curves have precisely one point 
in common, at which they cross each other. We exclude the case when all curves 
have a point in common and identify an arrangement with its mirror image. 

The oriented pseudoline arrangement is called simple, uniform or in general 
position, if no more than two pseudolines cross at a point. 

The rule to create a set of hyperline sequences HS{Ln) from an arrangement 
of oriented lines = {l\, . . . ,ln\ can be carried over in the same way to any ar- 
rangement PLn = {ph, ■ . ■ ,pln} of oriented pseudolines. Since there are oriented 
pseudoline arrangements for which there is no oriented line arrangement within 
the class of homeomorphic transformations for n > 9, we get in the pseudoline 
case a strictly more general concept, |{iLS'(L„)|L„ is an arrangement of lines }| 
< \{HS{PLn)\PLn is an arrangement of pseudolines }| for n > 9. 

We extend the concept of oriented matroids induced by hyperline sequences 
in another way. Hyperline sequences of configurations and arrangements of the 
last section store the signs of determinants of 3 x 3 submatrices of the matrix M of 
a corresponding vector configuration = {ui, . . . , u„} C , n > 3 Vi ^ 0, i = 
1, . . . ,n. This is an invariant for all matrices M' G c/„(M). Let i,j,k be three 
distinct signed indices in A„. Let [i,j,k] be the determinant of the submatrix of M 
with row vectors Vi, Vj, Vk- If j and k appear within the same set q), of tt^, we have 
sign [i,j, fc] = 0. If j and k occur in this order in some half-period of tt^, we have 
sign [i,j,k] = -|-1, and sign [i,j, fc] = — 1 otherwise. The sign of the determinant 
x(ijfc) := sign [i,j, fc] is independent of the chosen half periods and compatible 
by alternation xiyk) = xiJki) = x{kij) = ~x{ikj) = ~x{kji) = -x(jzfc) and 
anti-symmetry x(zjfc) = —xi'^jk). 
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Given an abstract set of hyperline sequences, let us choose its corresponding 

3 

normalized form and define y : — >■ {— 1,0,+1}, (partially) by: •= Oj 

if j and k appear within the same set Qs of for i in En, j, k in En, j yf k, 
x{ijk) '■= +1, if j and k occur in this order in and xi^jk) '■= —1, if j and k 
occur in the reversed order in 

Extending this partial definition of x by alternation and anti-symmetry, the 
value of x(ijfc) for 0 < i < j < fc is obtained either directly, by the above rule 
applied to each of the three hyperline sequences, or via alternation and anti- 
symmetry. When these three values for x(*J^) are compatible in all cases, we 
say that the set of hyperline sequences admit an abstract sign of determinant 
function. 

Definition 4. A rank 3 oriented matroid with n elements given by hyperline 
sequences is a set of hyperline sequences {{i,TTi) \ i € 1} over En which admit 
an abstract sign of the determinant function. The oriented matroid is uniform 
when all hyperline sequences are uniform. 



Theorem 1. The hyperline sequences HS{PLn) of an oriented pseudoline ar- 
rangement PLn admit an abstract sign of the determinant function. 

For each rank 3 oriented matroid given by hyper line sequences HS we can 
find an oriented pseudoline arrangement PLn which induces it, HS = H S {P Ln) . 
For a direct proof of this theorem see m- 

While the pseudoline arrangement of an oriented matroid shows that we 
are dealing with a topological invariant, the hyperline sequences show that this 
information can be stored effectively and that the complete generation of these 
objects for given number of elements and given rank becomes possible. The 
enumeration of corresponding possible types of matrices is available only by 
extending this problem to the concept of oriented matroids and deciding later 
which of them are realizable. 

We again see that it is an essential problem to find out for a given oriented 
matroid whether it belongs to the non-realizable ones or to the realizable ones. 

5 Extension Determined by Mutations in Rank 3 

We consider the set = {1, 2, . . . , n}. We start with a uniform oriented matroid 
X = Xe„ in rank 3 with n elements and its set of mutations Mut^ = {(i,j, k) \ i < 
j < k}. The triple (z, k) is a mutation when changing its sign leads again to an 
oriented matroid. By deleting the n-th element we obtain the oriented matroid 

Theorem 2. The extension of an oriented matroid x = Xe^-i rank 3 with 
n — 1 elements by an additional element n is uniquely determined by the signs 
of brackets (x,y,n) £ Mut^ together with an additional sign of a bracket 
(x, y, n) ^ Mut. 
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Remarks, (i) This theorem was first proved by Bokowski and Scharnbacher 
m- Their proof is unpublished. It was originally conjectured by Richter-Gebert 
(private communication). We present a new proof here. 

(ii) Figure 2 shows that the additional assumption is essential in many instances. 




Fig. 4. Changing all mutations at the line at infinity simultaneously leads again to an 
oriented matroid. 



Proof. W.l.o.g. we assume that (l,n — l,n) is a mutation and that the sign 
of (l,n — 2,n) which is not a mutation is given. We look at the normalized 
representation of the hyperline sequence. We see immediately that the hyperline 
sequence of element 1 is uniquely defined. As a consequence, the sign of element 
n in the i-ih hyperline sequence (i > 1) is uniquely defined. 

Again as a consequence, the position of the element n is uniquely defined 
in all hyperlines which are contained in a mutation of n. We collect all these 
hyperline indices in a set E\ . 

Now we consider the oriented matroid XBiufn} obtained by deleting the el- 
ements not in El \ {n} from xe- The Folkman Lawrence representation of xe 
defines an additional cell decomposition within the Folkman Lawrence repre- 
sentation of X_Biu{rt}- On all rank 2 contractions of elements in Ei, we have a 
unique ordering of the cocircuits of xe- Now we look at a cell C within the Folk- 
man Lawrence representation of XEiu{n} which contains a pseudoline segment of 
pseudoline n. Because of the known ordering of cocircuits of xe on the boundary 
of C, we can split the set of elements not in Ei U {n} into those which cut the 
pseudoline segment of pseudoline n, say {ci,...,Cfe} and into those which do 
not cut, say {pi, . . . ,p/}. A latter parallel element pm decomposes C into two 
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discs. We denote with Cp^ the corresponding disc which contains the pseudoline 
segment of pseudoline n. Now consider C := int{C fl Cp^ fl Cp^ fl . . . fl Cp^). 
When C contains no cocircuits of xe-, we are done. In this case the insertion 
segments for the n-th element is uniquely defined. 

But if a cocircuit exists in C , there is one which is adjacent to the n-th pseu- 
doline segment. The two elements defining that cocircuit cut the n-th pseudoline 
segment within C and define together with n a triangle in C . If this triangle 
does not contain a cocircuit in its interior, we find a mutation in it which was 
excluded. But in the other case, we find a smaller triangle completely included 
in the former one, again having one cocircuit adjacent to the n-th element. After 
a finite number of steps we arrive at a contradiction. 

6 Dynamic Inductive Realization 

When we apply the dynamic inductive realization method, we start with a re- 
alized oriented matroid with fc — 1 elements. We consider having fc — 1 great 
circles on a 2-sphere. We add the fc-th pseudo great circle on the upper half 
of the 2-sphere as a rubber band (we identify antipodal points on the equator 
by a diameter which can rotate around the midpoint). We mark the mutations 
with pins which guarantee that the rubber band lies always on the proper side 
with respect to a mutation. When the rubber band can be straightened to a half 
circle not violating the mutation conditions, we have found according to our last 
theorem a realization of the oriented matroid with k elements. 

When this great half circle cannot be rectified, we can change a former el- 
ement within the realization space of the former oriented matroid such that 
inserting the new element becomes easier. Doing this systematically for all fore- 
going elements was very effective in many instances. 

This idea of the author was implemented in the rank 3 case by K.P. Pock 
in his Diplom thesis |IB|, support was given also by J. Richter-Gebert. The cor- 
responding idea worked also in the rank 4 case. But the corresponding theorem 
about the mutations does not hold. Results can be found in the articles P2| 
There is an implementation in the rank 4 case by J. Scharnbacher 123]. 

7 Cinderella Playing with Polytopes 

The first impression when checking what can be done with the software Cin- 
derella by J. Richter-Gebert and U. Kortenkamp ( 22 !, or with any other sim- 
ilar dynamic program, these are applications to planar drawings and planar 
theorems. But when the applications for architects come to your mind (see 
e.g. http : //juergen.bokowski . de and http : / /homepages . tu-darmstadt . de/ 
"^raiwi) projections from higher dimensions to the plane appear. Dynamic ge- 
ometry programs can be used to visualize, study and modify these projections 
and by this obtaining new insights on the situation in high dimensions. 

When we start drawing the projections of the unit vectors of a basis in i?", 
we can insert e.g. projections of all vertices of a polytope. The latter change 
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of the positions of all projections of the unit vectors of the basis generate all 
possible linear projections of the polytope onto the plane. The edge graph can 
easily be drawn. The face lattice of the poly tope can be visualized and studied 
this way, facets can appear as line segments. A first application can be seen in 

m 

For another interesting 3-sphere with 16 vertices , and 50 simplicial facets: 

1234 1235 1246 1256 1347 1357 1467 1567 2348 2389 

2395 2480 2406 2809 209a 20a6 29a5 2a56 3468 3467 

368c 36c7 38c9 3c95 3c57 46d8 4bd7 4d80 4d06 4d67 

bdSc bdcl d0e6 dcel de67 0e9a 0eo6 ce95 ce57 c9a5 

eo56 e567 fdSO fdSc fdOe fdce /809 /8c9 /0e9 feed 

B. Sturmfels and A. Zelevinsky asked whether it forms the face lattice of a 4- 
polytope. We use this 3-sphere here as an example to show another method which 
was not applied in this context before. A realization was found by Bokowski 
by drawing the edge graph with its symmetry. Of course some intuition was 
essential to find the realization. But in order to confirm the lattice structure of 
the boundary of the polytope you can look at the Cinderella program output at 
http : //juergen .bokowski . de 

With the generators of the symmetry group 

(2) (5)(6)(8)(c)(d)(l,a)(6,/)(3,9)(4,0)(7,c) 

(3) (4)(7)(9)(0)(c)(l,6)(a,/)(5,9c(2,8)(6,d) 

(l)(a)(6)(/)(2, 6, 5)(9, 0, e)(8, d, c)(4, 7, 3) 




Fig. 5. A projection of the corresponding 4-polytope. 



Effective Methods in Computational Synthetic Geometry 191 



we can determine the orbits of the facets. By checking one facet in each orbit and 
by using the connectedness of the facets, we can confirm that the face lattice is 
that of a convex polytope. This is done for each facet by changing the projection 
contineously until a position is reached in which the vertices of that facet of the 
sphere lies on a supporting line of the projection of the polytope. 
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Abstract. Geometric straight-line programs m can be used to model 
geometric constructions and their implicit ambiguities. In this paper we 
discuss the complexity of deciding whether two instances of the same 
geometric straight-line program are connected by a continuous path, the 
Complex Reachability Problem. 



1 Introduction 

Straight-line programs and randomized techniques for proving their equivalence 
did find their application in geometric theorem proving. Using estimates for 
the degrees of the variables of a multivariate polynomial given by a straight- 
line program and evaluations for some random samples, we can prove geometric 
theorems with much less computational effort than usual prij . for example 
compared to symbolic methods using Grobner bases. 

An apparent drawback of polynomials is that we have to refer to systems of 
polynomial equations as soon as we want to describe theorems involving circles 
or conics. Although there are very powerful methods to do theorem proving in 
these contexts (e.g. Wu’s method, see nsna), it is desirable to have a concept 
like straight-line programs that is able to describe constructive theorems, and 
is able to model the dynamic aspects of theorems as they occur in dynamic 
geometry systems. The implementation of one dynamic geometry system m 
caused the definition of geometric straight-line programs, which are one way to 
approach the above issues. 

One question that must be settled before we could use techniques similar to 
the methods of Schwartz and Zippel i™ to prove geometric theorems is the 
question of (complex) reachability: Gan we move one instance of a geometric 
theorem continuously into another instance? This paper describes first results 
on the algorithmic complexity of this question. 

2 Geometric Straight-Line Programs 

Geometric straight-line programs extend the concept of straight-line programs 
(see the book of Biirgisser et al. PJ for a detailed discussion of straight-line pro- 
grams). Informally, a straight-line program (SLP) is a sequence of operations 
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(usually addition, multiplication, subtraction, and sometimes division) that op- 
erate on a certain input (usually values of some algebra A) or intermediate results 
from previous operations. 

Straight-line programs are important due to the fact that they provide a 
very compact description of multivariate polynomials (or rational functions, if 
we allow divisions) . The degree of the polynomials can be much higher than the 
length of the straight-line program (up to exponential). 

In it is shown that geometric constructions using points and lines as ob- 
jects, and meets and joins as operations, are equivalent to straight-line programs 
over IR or C. In a way this is a consequence of von-Staudt’s approach, who has 
shown that there is a coordinate-free description of projective geometry Pj. 

As soon as we want to describe constructions that involve ambiguous op- 
erations (like Intersection of Circle and Line, Intersection of Circle and Circle, or 
Angular Bisector of two lines) the concept of straight-line programs fails. Better 
said, it is not possible to describe constructions with varying input parameters 
that behave continuously using straight-line programs. 

Geometric straight-line programs (GSPs) are a way to keep a concise alge- 
braic description even for constructions involving ambiguous operations. The 
operations of a straight-line program are replaced by relations from a suitable 
relational instruction set (RIS). The objects can be choosen arbitrarily, as long 
as they match the relations. In this paper we will deal with the complex numbers 
C as objects and the RIS R := {-L, — , ±\A} only, and we will emphasize this 

sometimes by calling them eomplex GSPs. 

Again, we refer to pj for a more formal and detailed description. Here we rely 
on the readers‘ intuition and introduce geometric straight-line programs using 
an example. 



Example 1 (A GSP on (C, i?)/ Here is a GSP encoding the expression 
-|- with two input variables. The negative indices denote input vari- 
ables, the other ones index the intermediate results. All statements refer to the 
indices of previous results or input variables. 



Index 

-2 

-1 

0 

1 

2 
3 



Statement 

Z2 

Zl 

*(- 1 ,- 1 ) 
*(- 2 ,- 2 ) 
+ ( 0 , 1 ) 
±y^(2) 



Remark 

Input 

Input 










A fundamental difference between ordinary straight-line programs and GSPs 
is that we cannot just “run through” the statements of a GSP in order to cal- 
culate the expression for a given input. This is due to the fact that the relations 
can have different valid outputs for the same input. This gives rise to the no- 
tion of an instance of a GSP, an assignment of the input parameters and all 
intermediate results that is compatible with the relations. 



Example 2 (Instance of a GSP). An instance for the GSP above is given by 
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Index 


[Value! 


Remark 


-2 


3 


Input Z 2 


-I 


4 


Input zi 


0 


16 




I 


9 


Z2^ 


2 


25 


+ 22^ 


3 


1 -5 I 


±V-2l^ + Z2 



Observe that all but the last value are determined by the input, and there is 
only one other instance with the same input (where the last value is 5). 

Moving GSPs 

For polynomials, or straight-line programs, it is easy to speak about dynamic 
changes of the input parameters. Since the value of all intermediate results of 
an SLP is determined by the input, we can vary the input parameters and 
recalculate the polynomial. Of course, the intermediate results are polynomials 
in the input variables, and as such they are analytic functions, in particular 
continuous. 

If we want to do the same with GSPs we must specify how to resolve am- 
biguities. A natural requirement would be that the intermediate results should 
be continuous functions in the input parameters. A direct consequence is that 
the intermediate results must be analytic ^ in the following way: Let U := 
(ui, . . . , Un), V := (vi, . . . , Vn) C G" be two inputs for a complex GSP, and let 
7: [0, 1] I— >■ C” be a path from 7(0) = U to 7(1) = V. If we can find instances of 
the GSP for every A S [0,1] such that every intermediate result is an analytic 
function in A for A S (0, 1) and a continuous function for A G [0, 1], then these 
instances form an analytic path. 

Here are two examples showing the subtilities of analytic paths: 

Example 3 (Square Root). Take the complex GSP with one input that has the 
±\A-Relation as the one and only statement, and consider the path 

7: [0,1]^C 

7(A) = 

For each of the two possible choices at A = 0 there is a unique assignment of 
instances for A G (0, 1] to form an analytic path, which is the proper branch of 
the complex square root function. The value of the square root at A = 1 will be 
the negative of the value at A = 0. 

We can find this path by doing analytic continuations along 7, and here in 
this example it is clear that we can do this for all paths avoiding 0 for A G (0, 1), 
and only these. 

Example 4 (Roots of squares). Take the complex GSP with one input z and 
with two statements, first multiplying the input with itself and then the 
Relation. The first intermediate result, the square of the input, is determined 
by the input, and since it is a polynomial, it is analytic in the input z, so it is 
analytic for any analytic function 7. 
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The second relation can be simplified to either +z or —z, but not to the 
absolute value function |a:|, since this would destroy analyticity. We do not have 
to consider a special path to observe this, it holds for any path. 

In the second example there is not always a need to avoid the 0 for the square 
root function, for example for the path 7 (A) = 2A — 1 there are instances that 
make it analytic. However, in most considerations it will be a good idea to avoid 
any zeros of square roots, since these are the critical points where singularities 
can occur. 

3 Complex Reachability and Testing of Polynomials 

A problem in straight-line program analysis is to decide whether a given straight- 
line program is equivalent to another one, i.e. whether it describes the same 
polynomial (or rational function). The algorithmic complexity of this decision 
problem is unknown, but there exist polynomial-time randomized algorithms 
PI . The main obstacle is that we can neither handle the full, symbolic expression 
for the polynomial, since the coefficients and the degree of the polynomial can 
be large, nor the evaluation of the straight-line program for sufficiently large 
numbers, since the coding length for the intermediate results becomes too large. 

If we could find an algorithm to test equivalence of straight-line programs 
efficiently, then their range of application could be extended to efficient encod- 
ings of large numbers. It would also be possible to derive efficient deterministic 
algorithms to prove geometric theorems. 

We will now formulate a version of this decision problem which is equivalent 
to the equivalence testing problem. 

[SLP zero testing] Given a division-free straight-line program F over Q 
with one input variable. Is the polynomial p encoded by T the zero 
polynomial? 

We will show that this problem is at most as hard as deciding whether we can 
move analytically from one instance of a GSP to another instance of the same 
GSP that is different at exactly one intermediate result by giving a polynomial 
transformation from [SLP zero testing] to the following decision problem: 
[Complex Reachability Problem] Given two instances of a complex GSP 
with one input variable that differ in exactly one intermediate result. Is 
it possible to move analytically from the first instance to the second? 

We will prove the following theorem, with this corollary as an easy conse- 
quence: 

Corollary 1. The [Complex Reachability Problem] is algorithmically at least as 
hard as [SLP zero testing]. 

Theorem 1. There is a polynomial transformation of [SLP zero testing] to the 
[Complex Reachability Problem], i.e. we can answer an instance of [SLP zero 
testing] by transforming it to an instance of the [Complex Reachability Problem] 
and answering this. 
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Proof. First, we have to clarify how we specify an instance of a complex GSP in 
a polynomial size of the encoding length of the GSP (where the encoding length 
of the GSP is the number of bits needed to write down all statements of the 
GSP). We will deal with GSP inputs that have polynomial encoding length, and 
then we just have to specify for each ±-yt_statement which solution we choose. 
This can be done using one bit for each decision, saying to choose the solution 
with the smaller or equal angle in the polar coordinate representation of the 
two possibilities. We denote an instance by writing down all values of the input 
variables and a + or — for each decision bit. 

Observe that there is no need to evaluate the GSP; indeed, we must not 
evaluate the GSP since this could take exponential time. 

Having done this, we assume to have an SLP P of length n with one input 
variable 2 : and want to know whether it describes the zero polynomial. Let us 
refer to the last result, the polynomial, by p{z). 

Let M be the largest constant that can be created using a straight-line pro- 
gram Pm of length n and with encoding length less or equal to the encoding 
length of P. Using one additional statement we can write a straight-line pro- 
gram P 2 M that evaluates to 2M. Thus we can transform P in polynomial time 
and space to P' which evaluates p{z) + 2M. Due to the construction of P' the 
value at z = 0 of T' cannot be 0. 

Now we add one additional statement to P' in order to evaluate ±i/p(z) -I- 2M. 
Let the new GSP be Ti. In a similar way, we also create a GSP P 2 that evaluates 

zp{z) + 2M. This requires just one additional statement compared to P\ 

These two GSPs can be used to decide the zeroness of p{z) using the complex 
reachability decision. Let (z = 0, -k) be the start instance, and (z = 0, — ) the 
end instance for both Pi and P 2 - 

Now we claim that the reachability decision will be “not reachable” for both 
Pi and P 2 if and only if p(z) is the zero polynomial. For the direction we 
observe that p(z) + 2M = 2M and zp{z) + 2M = 2M, i.e. the arguments of the 
i-yd'-statement are constant and non-zero. So they can never change continuously 
from one sign decision to the other. 

For the =k direction we note that p{z) + 2M and zp{z) + 2M are two poly- 
nomials of even and odd resp. odd and even degree if p{z) ^ 0. This means that 
at least one of them has a root of odd multiplicity at, say, Zg- But this means 
that we can change the sign of the square root by following a path from z = 0 to 
z = zq -k e, cycling once around zg and going back from z = zg -k e to z = 0. So 
for at least one of P\ and P 2 the reachability decision will be “reachable.” □ 

4 Remarks 

The paper “Randomized Zero Testing of Radical Expressions and Elementary 
Geometry Theorem Proving” by Daniela Tulone, Ghee Yap and Ghen Li, that 
was also presented at ADG 2000, also introduces square roots for straight-line 
programs. The main difference between our two approaches is that we rely on the 
implicit sign decision for our notion of geometric theorems, which is different from 
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the usual notion of theorems given by polynomial equations for hypothesis, non- 
degeneracies and conclusions. Also, since we only work with complex numbers, 
we cannot state theorems that are given by semi-algebraic varieties. 

Nevertheless, it seems that both the results of both papers can be combined 
in one or the other way, which we will try to do in our further investigations. 

Kurt Mehlhorn pointed out that our transformation shows not only that the 
complex reachability problem is as hard as to find out whether a polynomial is 
the zero polynomial, but also as hard as to find out whether a polynomial has 
at least one root of odd degree. 
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Abstract. In this paper we propose a bracket algebra based elimina- 
tion method for automated generation of readable proofs for theorems 
in incidence geometry. This method features three techniques, the first 
being heuristic automated reordering of geometric constructions for the 
purpose of producing shorter proofs, the second being some heuristic 
elimination rules which improve the performance of the area method of 
Zhang and others without introducing signed length ratios, the third be- 
ing a simplification technique called contraction, which reduces the size 
of bracket polynomials. More than twenty theorems in incidence geome- 
try have been proved, for which short proofs can be produced very fast, 
together with the corresponding nondegeneracy conditions. An interest- 
ing phenomenon is that a proof composed of polynomials of at most two 
terms can always be found for any of these theorems, similar to that by 
the biquadratic final polynomial method of Richter-Gebert. 



1 Introduction 

According to the first fundamental theorem of invariant theory jl Yj , brackets are 
the fundamental invariants under projective transformations. From an invariant 
theoretic point of view, the ring of brackets forms a suitable algebraic setting to 
deal with projective configurations unna. Bracket algebra is the most general 
structure in which projective properties can be expressed in a coordinate-free 
way. 

Let be an (n -|- l)-dimensional vector space. For a sequence of n -I- 1 

vectors Ai, . . . , A„ 4 _i G the corresponding bracket is defined by 



[Ai • • • A„_|_i] — det(Ai • • • A„+i). (1) 

Let Ai, . . . , Am be indeterminates (vectors) in m > n. The bracket algebra 

generated by them is the polynomial algebra 'R,{[Ai^ ■ ■ ■ < ij < m) gen- 

erated by all possible brackets of the indeterminates modulo the ideal generated 
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by the following Grassmann-Pliicker polynomials: 

( n+2 

= E (-l)'=[An • • • A,„ ■ • • A,,_, A,,^, • • • A 

Lfe=i 



■Jn+2j 



1 <ii < ■ ■ ■ < in < m, 1 < ji < • • • < jn+2 <m > . 



( 2 ) 



On the level of bracket algebra, a geometric theorem prover can be imple- 
mented using the straightening algorithm E2EI The main idea behind this 
approach is to rewrite the projective incidence statement as a term in Grass- 
mann algebra which vanishes if and only if the statement is true. After this, the 
Grassmann algebra term is expanded into a bracket algebra term. If this term 
vanishes modulo the ideal generated by the Grassmann-Pliicker polynomials, 
then the theorem is proved. It is shown by Sturmfels and White m that the 
straightening algorithm can be considered as a special kind of Grobner bases 
algorithm for bracket polynomials. The algorithm works in full generality, but 
requires over-exponential GPU time. 

The prover proposed by Richter-Gebert HS| is based on the biquadratic fi- 
nal polynomial method (see also mm)- A proof produced by this prover is 
extremely short and geometrically meaningful. In particular, every polynomial 
which occurs in the proof is composed of two terms. Although the algorithm does 
not work in general, it could manage almost all projective incidence theorems. 

Another prover is proposed by Ghou and others P| and is based on the area 
method. This is an elimination method whose rules are derived from properties 
of signed areas, or brackets in 2-dimensional projective space. This method is 
complete when area coordinates are used. When the coordinates are avoided, 
proofs produced by the prover are often short and readable. 

Our work is inspired both by the area method and by the final polynomial 
method. First, we propose a set of heuristic elimination rules to improve the 
performance of the area method by producing shorter proofs (for elimination 
methods, see also UMIHIZI). Second, we propose a new technique for bracket 
polynomial simplification, of which a special case is used as the foundation for 
setting up biquadratic equations in the final polynomial method. Third, as an 
optional technique, we use the heuristic method in for automated reordering 
of geometric constructions in case a polynomial in the proof has more than two 
terms. We build up a prover based on the three techniques. 

The performance of the prover is very satisfactory: more than twenty inci- 
dence theorems have been tested, which covers all the 2-dimensional incidence 
theorems in PCS]. For every theorem, a proof composed of polynomials of at 
most two terms can be produced very fast. Furthermore, every proof finishes 
before any free point in the plane is eliminated, and in some cases, even before 
some semifree points on lines are eliminated. 

The prover is complete for 2-dimensional incidence theorems of the following 
constructive types. 

Constructive type 1. Take a free point in the plane. 



Constructive type 2. Take a semifree point on a line. 



Constructive type 3. Take the intersection of two lines. 
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2 Algorithm 



The following is an algorithm which can produce a proof in the form of brackets 
for a theorem whose conclusion is either an equality or an inequality of negative 
type. 



Input: 

— A set of constructions of points. 

— An order for eliminations of points. 

— A conclusion cone = 0 or cone ^ 0, where cone is a polynomial of 
brackets. 

Preprocess. Change every collinearity constraint in the constructions into a 
rule for computing brackets. (Optional) reorder the points together with 
their geometric constructions. 

Step 1. Eliminate constrained points and semifree points. First, as- 
sume that point X is the intersection of lines AB and CD. To eliminate X 
from a bracket [XPQ], there are three formulas available: 

[XPQ] = X V (P A Q) = (A A B) V (C A D) V (P A Q) 



r [ABD][CPQ] - [ABC][DPQ] 
= \ [ACD][BPQ] - [BCD][APQ] 
[ [ABP][CDQ] - [ABQ][CDP] 



01 ) 

02) (3) 

03) 



Rule 1 (same as in the area method). If a bracket in (0 equals zero, 
use the corresponding formula. 

Rule 2 (heuristic). In general, use the formula which separates into dif- 
ferent brackets the pair of points in (A, B), (C, D), (P, Q) having the 
largest number of concurrent lines. 



In the area method, 02) is generally adopted. 

Second, assume that point X is on line AB. Let A,B,C be linearly inde- 
pendent vectors. To eliminate X from a bracket polynomial p, first contract 
p (see Step 2), then for each [XPQ] in p, apply the following formula: 



[ABC] [XPQ] = [XBC][APQ] - [XAC][BPQ], (4) 



which is a Grassmann-Pliicker relation in the case [XAB] = 0. 

Rule 3 (heuristic). In general, choose C to be the free point outside line 
AB that has the largest number of occurrences in p. The nondegeneracy 
condition is [ABC] 0 if [ABC] occurs in the denominator of p. 

Step 2. Simplification by contraction. For any vectors Ai, . . . , A 5 in 7^^, 



[AiA 2A5][A3A4A5] -t- [AiAsA 5][A4A2A5] = [Ai A4 A5] [A3 A2 A5] . 



This is a Grassmann-Pliicker relation. 
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Let p be a bracket polynomial of two terms. If p is reduced to a single 
monomial by the above identity, this reduction is called a contraction. It can 
be extended to any bracket polynomial. 

It can be proved that a polynomial is reduced to zero modulo the ideal gen- 
erated by the Grassmann-Pliicker polynomials if and only if when multiplied 
by some bracket polynomial, it is reduced to zero through contractions. As 
a result, the outcome of the contraction is always zero for cone = 0, and 
nonzero for cone ^ 0. 

Output: The proving procedure and the nondegeneracy conditions. 

Remark 1. In earlier papers [imni, we addressed the problem of theorem prov- 
ing in projective geometry involving both lines and conics. The current algorithm 
has the function of reordering geometric constructions, i.e., reformulation of ge- 
ometric theorems. At this moment it works only for incidence geometry. 

Remark 2. The heuristic rules 2 and 3 can contribute to obtaining short proofs. 
When searching for a proof composed of polynomials of at most two terms, these 
rules can serve as guidelines for setting up precedence tables. 

Remark 3. The method is between the biquadratic final polynomial method P 
and the reduction method in classical invariant theory P|. 

Remark 4. To improve the performance of the algorithm for cone = 0, after 
each elimination we can delete the common bracket factors in cone. These factors 
are not nondegeneracy conditions. 



3 Examples 

Below is a collection of 23 examples and their machine generated proofs com- 
posed of polynomials of at most two terms. The program is written in Maple 
V.4 and runs on an IBM compatible Pentium 11/366 with Microsoft Windows 
98. The generation of each proof is very fast. The nondegeneracy conditions are 
generated at the same time. 

For theorems of equality type, common bracket factors (underlined) are found 
out in each step and are deleted before the next step starts. 

Example 1 (See also |3|, Example 6.203). 

Free points: 1, 2, 3, 4. 

Intersections: 

5 = 12 n 34, 6 = 13 n 24, 7 = 23 n 14, 

8 = 23 0 56, 9 = 24 0 57, 0 = 34 0 67. 



Conclusion: 8, 9, 0 are collinear. 
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Proof: 



Rules [ 890 ] 



0 

= [ 347 ] [ 689 ] -[ 346 ] [ 789 ] 



[ 689 ] = [ 248 ] [ 567 ] 
[ 789 ] = [ 247 ] [ 578 ] 

[ 248 ] = [ 236 ] [ 245 ] 
[ 578 ] = [ 235 ] [ 567 ] 

[ 347 ] = [ 134 ] [ 234 ] 
[ 247 ] = [ 124 ] [ 234 ] 

[ 236 ] = [ 123 ] [ 234 ] 
[ 346 ] = [ 134 ] [ 234 ] 

[ 245 ] = -[ 124 ] [ 234 ] 
[ 235 ] = -[ 123 ] [ 234 ] 



9 

= [ 248 ] [ 347 ] [ 567 ] - [ 247 ] [ 346 ] [ 578 ] 

8 

= [ 567 ] [ 236 ] [ 245 ] [ 347 ] - [ 567 ] [ 235 ] [ 247 ] [ 346 ] 
7 

= [ 234 ] [ 134 ] [ 245 ] [ 236 ] - [ 234 ] [ 124 ] [ 235 ] [ 346 ] 
6 

= [ 134 ] [ 234 ] ( [ 123 ] [ 245 ] - [ 124 ] [ 235 ] ) 

5 

= 0 . 



Nondegeneracy condition: none. 

Example 2 (See also jSI, Proposition 5.8). 

Free points: 1, 2, 3, 4. 

Intersections: 



5 = 12 n 34, 6 = 13 n 24, 7 = 23 n 14, 
8 = 13 n 57, 9 = 67 n 48, 0 = 24 n 57. 



Conclusion: 3, 9, 0 are collinear. 
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Proof: 

Rules [ 390 ] 

0 

= -[ 257 ] [ 349 ] -[ 239 ] [ 457 ] 

9 

= [ 257 ] [ 348 ] [ 467 ] - [ 230 ] [ 457 ] [ 478 ] 

8 

= - [ 134 ] [ 257 ] [ 357 ] [ 467 ] + [ 137 ] [ 230 ] [ 457 ]^ 

7 

= [ 134 ] [ 234 ] (- [ 124 ] [ 134 ] [ 146 ] [ 235 ]^ - [ 123 ] [ 145 ]^ [ 234 ] [ 236 ]) 

= [ 124 ]^ [ 134 ]^ [ 235 ]^ - [ 123 ]^ [ 145 ]^ [ 234 ]^ 

5 

= 0 . 



[ 349 ] = -[ 348 ] [ 467 ] 
[ 239 ] = [ 236 ] [ 478 ] 

[ 348 ] = -[ 134 ] [ 357 ] 
[ 478 ] = -[ 137 ] [ 457 ] 

[ 257 ] = [ 124 ] [ 235 ] 

[ 467 ] = [ 146 ] [ 234 ] 

[ 357 ] = [ 134 ] [ 235 ] 

[ 457 ] = [ 145 ] [ 234 ] 

[ 137 ] = -[ 123 ] [ 134 ] 

[ 146 ] = -[ 124 ] [ 134 ] 
[ 236 ] = [ 123 ] [ 234 ] 

[ 235 ] = -[ 123 ] [ 234 ] 
[ 145 ] = -[ 124 ] [ 134 ] 



Nondegeneracy condition: none. 

Example 3 (See also p. 63). 

Free points: 1, 2, 3, 4. 

Intersections: 

5 = 12 n 34, 6 = 13 n 24, 7 = 23 n 14, 
8 = 13 n 57, 9 = 14 n 56, 0 = 34 n 67. 



Conclusion: 8, 9, 0 are collinear. 
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Proof: 



Rules [ 890 ] 



0 

= [ 347 ] [ 689 ] -[ 346 ] [ 789 ] 



[ 689 ] = [ 156 ] [ 468 ] 

[ 789 ] = - [ 148 ] [ 567 ] 

[ 468 ] = - [ 134 ] [ 567 ] 
[ 148 ] = - [ 134 ] [ 157 ] 

[ 347 ] = [ 134 ] [ 234 ] 
[ 157 ] = [ 123 ] [ 145 ] 

[ 156 ] = -] 124 ]] 135 ] 
[ 346 ] = [ 134 ] [ 234 ] 

[ 135 ] = -] 123 ]] 134 ] 
[ 145 ] = - [ 124 ] [ 134 ] 



9 

= [ 156 ] [ 347 ] [ 468 ] + [ 148 ] [ 346 ] [ 567 ] 

8 

= [ 134 ] [ 567 ] (- [ 156 ] [ 347 ] - [ 157 ] [ 346 ] ) 

7 

= - [ 134 ] [ 156 ] [ 234 ] - [ 123 ] [ 145 ] [ 346 ] 

6 

= [ 134 ] [ 234 ] ([ 124 ] [ 135 ] - [ 123 ] [ 145 ]) 

5 

= 0 . 



Nondegeneracy condition: none. 

Example 4 (See also 0, Example 6.32). 

Free points: 1, 2, 3, 4, 5. 

Intersections: 

6 = 12 n 34, 7 = 13 n 24, 8 = 23 n 14, 9 = 56 n 78, 
0 = 57 n 68, A = 39 n 20, B == 67 n 58. 



Conclusion: 1, A, B are collinear. 
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Proof: 



Rules [lAB] 



[17A] = -[179] [230] 
[16A] = -[160] [239] 

[230] = [236] [578] 
[160] = [168] [567] 

[179] = -[178] [567] 
[239] = [237] [568] 

[178] = [123] [147] 
[168] = [123] [146] 

[147] = -[124] [134] 
[237] = [123] [234] 

[236] = -[123] [234] 
[146] = -[124] [134] 



= [17A][568]-[16A][578] 

A 

= - [179] [230] [568] + [160] [239] [578] 

0 

= - [578] [179] [236] [568] + [578] [168] [239] [567] 
9 

= [567] [568] ([178] [236] + [168] [237]) 

8 

= [123] [147] [236] + [123] [146] [237] 

7 

= - [124] [134] [236] + [123] [146] [234] 

6 

= 0 . 



Nondegeneracy condition: none. 



Example 5 (Pappus point theorem, see also |^, Example 6.22). 

Free points: 1, 2, 3, 4, 5. 

Intersections: 

6 = 13 n 24, 7 = 23 n 56, 8 = 25 n 34, 9 = 12 n 68, 
0 = 79 n 24, A = 39 n 67, B = 15 n 4A, C = 28 n 39. 

Conclusion: 0, B, C are collinear. 
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Fig. 5. Example 5. 



Proof: 



Rules [OBC] 



[39B] = -[ISA] [349] 
[28B] = [128] [45A] 

[15 A] = [167] [359] 
[45 A] = [359] [467] 

[280] = [248] [279] 
[390] = [249] [379] 

[279] = -[127] [268] 
[349] = [128] [346] 

[379] = -[137] [268] 
[249] = -[124] [268] 

[248] = [234] [245] 

[268] = -[234] [256] 

[127] = [123] [256] 

[167] = -[156] [236] 
[467] = -[236] [456] 
[137] = [123] [356] 

[156] = -[124] [135] 
[346] = [134] [234] 

[456] = [134] [245] 

[356] = [135] [234] 



C 

= [280][39B]-[28B][390] 

B 

= - [15A] [280] [349]- [128] [390] [45 A] 

A 

= - [359] [167] [280] [349] - [359] [128] [390] [467] 
0 

= - [167] [248] [279] [349]- [128] [249] [379] [467] 



9 

= [128] [268] ( [127] [167] [248] [346] - [124] [137] [268] [467] ) 



= [234] [127] [167] [245] [346] + [234] [124] [137] [256] [467] 



7 

= [123] [236] [256] (- [156] [245] [346] - [124] [356] [456]) 



6 

= 0 . 



Nondegeneracy condition: none. 
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Example 6 (Pappus’ theorem, see also |2|, Example 6.20). 

Free points: 1,2, 3, 4. 

Semifree points: 5 on 12, 6 on 34. 

Intersections: 7 = 23 fi 14, 8 = 35 fl 16, 9 = 45 fl 26. 
Conclusion: 7, 8, 9 are collinear. 




Fig. 6. Example 6. 



Proof: 



Rules 



[ 278 ] = [ 136 ] [ 257 ] 

[ 678 ] = [ 167 ] [ 356 ] 

[ 257 ] = [ 124 ] [ 235 ] 

[ 167 ] = [ 123 ] [ 146 ] 

[ 134 ] [ 456 ] = -[ 146 ] [ 345 ] 
[ 134 ] [ 356 ] = -[ 136 ] [ 345 ] 



8 



7 



6 



[ 789 ] 

[ 278 ] [ 456 ] -[ 245 ] [ 678 ] 

[ 136 ] [ 257 ] [ 456 ] - [ 167 ] [ 245 ] [ 356 ] 

[ 124 ] [ 136 ] [ 235 ] [ 456 ] - [ 123 ] [ 146 ] [ 245 ] [ 356 ] 
[ 130 ] [ 146 ] [ 345 ] 

(-[ 124 ][ 235 ] + ] 123 ][ 245 ]) 

[ 134 ] 



= 0 . 



Nondegeneracy condition: [134] yf 0. 




Fig. 7. Example 7. 
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Example 7 (Desargues’ theorem, see also 0, Example 6.24). 

Free points: 1,2, 3, 4, 5. 

Semifree point: 6 on 13. 

Intersections: 7 = 12 fi 45, 8 = 15 iT 24, 9 38 fl 56, 0 = 23 iT 49. 

Conclusion: 6, 7, 0 are collinear. 

Proof: 



Rules [ 670 ] 



[ 239 ] = -[ 238 ] [ 350 ] 
[ 679 ] = [ 368 ] [ 507 ] 

[ 238 ] = -[ 125 ] [ 234 ] 
[ 368 ] = [ 124 ] [ 356 ] 

[ 467 ] = [ 124 ] [ 450 ] 
[ 567 ] = [ 125 ] [ 450 ] 



0 

= [ 239 ] [ 467 ] -[ 234 ] [ 679 ] 

9 

= - [ 238 ] [ 356 ] [ 467 ] - [ 234 ] [ 368 ] [ 567 ] 



8 

= [ 234 ] [ 356 ] ( [ 125 ] [ 407 ] - [ 124 ] [ 567 ] ) 



7 

= 0 . 



Nondegeneracy condition: none. 

Example 8 (See also |3|, Example 6.34). 

Free points: 1, 2, 3. 

Semifree points: 4 on 12, 5 on 12, 6 on 13, 7 on 23. 
Intersections: 



8 ==23 0 46, 9 = 23 0 56, 0 = 13 0 57, 
A = 13 0 47, B = 12 0 80. 

Conclusion: 9, A, B are collinear. 




Fig. 8. Example 8. 
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Proof: 



Rules [9AB] 



B 



[89 A] = [137] [489] 

[90 A] = [139] [470] 

[120] = [123] [157] 

[470] = -[137] [457] 

[489] = -[236] [458] 

[139] = [123] [350] 

[458] = -[234] [450] 

[128] = [123] [240] 

[123] [157] = -[127] [135] 
[123] [457] = [127] [345] 

[123] [456] = [120] [345] 

[123] [350] = [135] [230] 

[123] [240] = -[126] [234] 



A 



0 



8 



7 



0 



[120] [89A] + [128] [90A] 

[120] [137] [489] + [128] [139] [470] 

[137] [123] [157] [489]- [137] [128] [139] [457] 

- [123] [157] [236] [458]- [123] [128] [356] [457] 

[157] [234] [236] [450] - [123] [246] [356] [457] 

[127] 

(- [135] [234] [236] [450]- [123] [246] [345] [356]) 

[123] 

0 . 



Nondegeneracy condition: [123] ^ 0. 

Example 9 (See also |3, Example 6.38). 

Free points: 1, 2, 3, 4. 

Semifree point: 5 on 12. 

Intersections: 

6 = 12 0 34, 7=13 0 24, 8 = 23 0 14, 9 = 13 0 45, 0 = 23 0 45, 

A = 14 O 35, B = 24 O 35, C = 12 O 89, D = 12 O 70, E = 12 O OA. 

Conclusions: (1) 7, A, C are collinear; (2) 8, B, D are collinear; (3) 9, B, 
E are collinear. 

Proof: (1) 



Rules [7AC] 



= [189][27A]-[17A][289] 



[27A] = [127] [345] 

[17A] = -[135] [147] 

[189] = -[138] [145] 
[289] = -[128] [345] 

[138] = -[123] [134] 
[128] = -[123][124] 



= [127] [189] [345] + [135] [147] [289] 

9 

= - [345] [127] [138] [145] - [345] [128] [135] [147] 



= [123] [127] [134] [145] + [123] [124] [135] [147] 
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[127] = [123] [124] 

[147] = -[124] [134] 



7 



[124] [134] ( [123] [145] - [124] [135] ) 



= 0 . 



Rules [8BD] 



[28B] = -[235] [248] 
[18B] = - [128] [345] 

[170] = [127] [345] 

[270] = -[237] [245] 

[248] = -[124] [234] 
[128] = -[123][124] 

[127] = -[123][124] 
[237] = [123] [234] 



= [170][28B]-[18B][270] 

B 

= - [170] [235] [248] + [128] [270] [345] 

0 

= - [345] [127] [235] [248] - [345] [128] [237] [245] 
8 

= [124] [127] [234] [235] + [124] [123] [237] [245] 

7 

= [123] [234] (- [124] [235] + [123] [245] ) 



= 0 . 



Rules [9BE] 

E 

= [10A][29B]-[19B][20A] 

B 

= - [10 A] [235] [249] + [159] [234] [20 A] 



[29B] = -[235] [249] 
[19B] = -[159] [234] 
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= [135] [140] [235] [249] - [134] [159] [234] [250] 

0 

= [234] [235] (- [135] [145] ]249] + [134] [159] [245]) 
9 

= 0 . 

Nondegeneracy condition: none. 

Example 10 (See also 0, Example 6.208). 

Free points: 1, 2, 3, 4. 

Semifree point: 5 on 12. 

Intersections: 

6 = 12 n 34, 7 = 13 n 24, 8 = 13 n 45, 9 = 23 n 67, 

0 = 24 n 19, A = 34 n 19, B = 23 n 80 , C = 49 n 30. 

Conclusions: (1) 5, A, B are collinear; (2) 7, A, C are collinear. 



[lOA] = - [135] [140] 
[20A] = - [134] [250] 

[140] = -[145] [234] 
[250] = -[235] [245] 

[249] = -[134] [245] 
[159] = -[145] [135] 




Proof: (1) 



Rules [5AB] 

B 

= [280] [35A]- [25 A] [380] 

A 

= [139] [280] [345] + [134] [259] [380] 

0 

= [129] [139] [248] [345] - [124] [134] [259] [389] 



[35 A] = [139] [345] 

[25A] = -[134] [259] 

[280] = [129] [248] 

[380] = -[124] [389] 
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[129] = [123] [267] 

[139] = [123] [367] 9 ^ 

[389] = -[238] [367] 

[259] = -[235] [267] 

[248] = -[134] [245] _8 

[238] = -[123] [345] 



[267] [367] ([123]^ [248] [345]- [124] [134] [235] [238]) 



[123] [134] [345] (- [123] [245] + [124] [235]) 



= 0 . 



( 2 ) 



Rules [7AC] 



[79A] = [179] [349] 
[47A] = [149] [347] 

[340] = [149] [234] 
[390] = [139] [249] 

[179] = -[167] [237] 
[349] = -[234] [367] 
[249] = -[234] [267] 
[139] = [123] [367] 

[237] = [123] [234] 

[167] = -[124] [136] 
[347] = [134] [234] 

[267] = -[123] [246] 

[136] = -[123] [134] 
[246] = -[124] [234] 



= [340][79A] + [390][47A] 

A 

= [179] [349] [340] + [149] [347] [390] 

0 

= [149] [179] [234] [349] + [149] [139] [249] [347] 



9 

= [234] [367] ( [167] [234] [237] - [123] [267] [347] ) 



7 

= [123] [234] (- [124] [136] [234] + [123] [134] [246]) 



6 

= 0 . 



Nondegeneracy condition: none. 

Example 11 (Nehring’s theorem, see also |^, Example 6.27). 

Free points: 1, 2, 3, 4. 

Semifree point: 5 on 12. 

Intersections: 



6 = 12 n 34, 7 = 13 n 24, 8 = 23 n 14, 9 = 13 n 58, 

0 = 23 n 69, A = 12 n 70, B = 13 n 8A, C = 23 n 6B. 



Conclusion: 5, 7, C are collinear. 
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Proof: 



[57C] 

- [235] [67B] - [237] [56B] 

- [136] [235] [78 A] - [13 A] [237] [568] 



[127] [136] [235] ]780] + [123] [170] [237] [568] 



[127] [237] (- [136] [235] ]689] + [123] [369] [568]) 



[136] [568] (- [138] [235] - [123] [358]) 



8 

= 0 . 



Nondegeneracy condition: none. 

Example 12 (See also [E], Example 7). 

Free points: 1, 2, 3, 4, 5, 6, 7, 8, 9. 

Semifree point: 0 on 19. 

Intersections: 

A = 13 0 24, B = 24n35, C = 35n46, 

E = 57n68, F = 68ni7, G = 17n28, 

Ai = 29nOH, Bi = 39n AAi, Cl = 49nBBi, 
El = 69nDDi, El = 79nEEi, Gi = 89nFFi. 

Conclusion: 0, G, Gi are collinear. 



D = 46 n 57, 

H = 28ni3, 
Di = 59nCCi 



Rules 



[67B] = [136] [78 A] 
[56B] = ]13A]]568] 

[78A] = - [127] [780] 
[13A] = - [123] [170] 

[780] = - [237] [689] 
[170] = [127] [369] 

[689] = [138] [568] 

[369] = - [136] [358] 

[138] = - [123] [134] 
[358] = [134] [235] 
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1 





5 



Fig. 12. Example 12. 



[OGGi] 

Gi 

= [8FFi][90G]-[80G][9FFi] 

Fi 

= -[79E][8FEi] [90G] + [79F] [80G] [9EEi] 

El 

= [6DDi] [79E] [89F] [90G]- [69D] [79F] [80G] [9DDi] 

Di 

= - [59C] [6DGi] [79E] [89F] [90G] + [59D] [69E] [79F] [80G] [9GCi] 
Gi 

= [4BBi] [59G] [69D] [79E] [89F] [90G] 

- [49C] [59D] [69E] [79F] [9BBi] [0G8] 

Bi 

= - [39A] [4BAi] [59G] [69D] [79E] [89F] [90G] 

+ [39B] [49G] [59D] [69E] [79F] [80G] [9 A Ai] 

Ai 

= [20H] [39 A] [49B] [59C] [69D] [79 E] [89F[ [90G] 

- [29 A[ [39B[ [49G] [59D] [69E] [79F] [80G] [90H] 

H 

= [123] [280[ [39A] [49B] [59C[ [09D[ [79E[ [89F[ [90G] 

+ [128] [29 A] [390] [39B] [49G] [59D] [69E] [79F] [80G] 

G 

= [128] [280] [123] [39 A] [49B] [59G] [69D] [790] [79E] [89F] 

+ [128] [280] [178] [29 A] [390] [39B] [49C] [SOD] [69E] [79F] 

F 

= - [178] [123] [39A] [49B] [59C] [689] [69D] [790] [79E] 

- [178] [179] [29A] [390] [39B] [49C] [SOD] [678] [69E] 



Proof: 

Rules 



[8FFi] 


= -]79E][8FEi] 


[9FFi] 


= -[79F][9EEi] 


[8FEi] 


= -]6DDi][89F] 


[9EEi] 


= -]69E][9DDi] 


[6DDi 


= -[59C][6DCi] 


[9DDi 


= -[59D][9CCi] 


[6DGi] 


= -[4BBi][69D] 


[9CCi] 


= -]49C][9BBi] 



[4BBi] = -]39A][4BAi] 
[9BBi] = -]39B][9AAi] 

[4BAi] = -[20H][49B] 
[9AAi] = -]29A][90H] 

[20H] = [123] [280] 

[90H] = -[128] [390] 

[90G] = [128] [790] 

[80G] = [178] [280] 

[89F] = -[178] [689] 

[79F] = -[179] [678] 
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E 

= - [678] [689] ]123] [39A] ]49B] [579] ]59C] [69D] [790] 

+ [678] [689] [179] [29 A] [390] [39B] [49C] [567] [59D] 

D 

= - [567] [579] [123] ]39A] [469] ]49B] ]59C] [790] 

- [567] [579] [179] [29A] [390] ]39B] [456] ]49C] 

C 

= - [456] [469] [123] [359] [39A] ]49B] [790] 

+ [456] [469] [179] [29 A] [345] [390] [39B] 

B 

= [345] [359] (- [123] [249] [39A] [790] - [179] [234] [29A] [390]) 
A 

= [123] [234] [249] (- [139] ]790] + ]179] [390]) 

= 0 . 

Nondegeneracy condition: none. 



]79E] = [579] [678] 

]69E] = -[567] [689] 



]69D] = [469] [567] 

]59D] = -[456] [579] 



]59C] = [359] [456] 

]49C] = - [345] [469] 



]49B] = [249] [345] 

]39B] = -[234] [359] 

[39 A] = [139] [234] 

[29 A] = -[123] [249] 




Fig. 13. Example 13. 



Example 13 (Saam’s theorem, see also |E], Example 6). 

Free points: 1, 2, 3, 4, 5, 6. 

Semifree point: 7 on 12. 

Intersections: 



8 = 13 0 24, 9 = 23 0 14, 0 = 15 0 46, 
A = 35 016, B = 13 0 67, C = 16 O 90, 
D = 15 O 8A, E = 12 O BC, F = 57 O 14. 



Conclusion: D, E, F are collinear. 
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Proof: 

Rules [DEF] 

F 

= [145][7DE]-[147][5DE] 

E 

= - [12D] [145] [7BC] - [147] [IBC] [25D] 

D 

= - [125] [145] [18 A] [7BC] + [125] [147] [IBC] [58 A] 

C 

= [145] [17B] [18A] [690]- [147] [16B] [190] [58A] 

B 

= - [167] [137] [145] [18 A] [690] + [167] [136] [147] [190] [58 A] 

A 

= - [135] [137] [145] [168] [690] + [136] [147] [156] [190] [358] 

0 

= - [156] [135] [137] [145] [168] [469] - [156] [136] [146] [147] [159] [358] 
9 

= [145] [146] (- [135] [137] [168] [234]- [123] [136] [147] [358]) 

8 

= [135] [136] [234] ([124] [137]- [123] [147]) 

= 0 . 

Nondegeneracy condition: none. 

Example 14 (See also 0, Example 6.190). Two doubly perspective triangles 
are also triply perspective. 

Free points: 1, 2, 3, 4, 5. 

Intersections: 6 = 12 fl 34, 7 = 24 iT 15, 8 = 13 fl 45, A = 56 iT 37. 
Conclusion: 2, 8, 9 are collinear. 

Proof: 



[7DE] = -]12D][7BC] 
[5DE] = [1BC][25D] 

[12D] = [125] [18A] 

[25D] = -[125][58A] 

[7BC] = -]17B][690] 
[IBC] = -[16B][190] 

[17B] = -[137] [167] 
[16B] = -[136] [167] 

[18A] = [135] [168] 

[58 A] = [156] [358] 

[690] = [156] [469] 

[190] = -[146] [159] 

[469] = [146] [234] 
[159] = [123] [145] 

[168] = -[124] [136] 
[358] = [135] [234] 



Rules [289] 



[268] = -[145] [236] 
[258] = -[135] [245] 

[357] = [135] [245] 
[367] = [145] [236] 



9 

= [268] [357] -[258] [367] 

8 

= -[145] [236] [357] + [135] [245] [367] 

7 

= 0 . 



Nondegeneracy condition: none. 
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Fig. 14. Example 14. 



Example 15 (See also 0, Example 6.26). In a hexagon whose vertices are 1, 
2, 3, 4, 5, 9, if both 39, 12, 45 and 19, 34, 25 are concurrent, then 14, 59, 
23 are concurrent. 

Free points: 1,2, 3, 4, 5. 

Intersections: 6 = 23 n 14, 7 = 12 n 45, 8 = 34 n 25, 9 = 37 n 18. 
Conclusion: 5, 6, 9 are collinear. 




Fig. 15. Example 15. 



[ 569 ] 

9 

= [ 178 ][ 356 ]-[ 138 ][ 567 ] 

8 

= [ 157 ] [ 234 ] [ 356 ] + [ 134 ] [ 235 ] [ 567 ] 

7 

= - [ 125 ] [ 145 ] [ 234 ] [ 356 ] + [ 125 ] [ 134 ] [ 235 ] [ 456 ] 



6 

= 0 . 



Proof: 

Rules 



[ 178 ] = [ 157 ] [ 234 ] 

[ 138 ] = -[ 134 ] [ 235 ] 

[ 157 ] = -[ 125 ] [ 145 ] 
[ 567 ] = [ 125 ] [ 456 ] 

[ 356 ] = [ 134 ] [ 235 ] 
[ 456 ] = [ 145 ] [ 234 ] 
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Nondegeneracy condition: none. 

Example 16 (Permutation theorem, see also PS!, Example 3). If 6, 7, 8, 9 are 
collinear, then there exits a projectivity between (8, 9, 7, 6) and (6, 7, 9, 8). 

Free points: 1, 2, 3, 4. 

Semifree point: 5 on 23. 

Intersections: 

6 = 12 0 34, 7=13 0 24, 8 = 15 0 67, 9 = 45 0 67, 0 = 23 0 48. 

Conclusion: 1, 9, 0 are collinear. 




Fig. 16. Example 16. 



Proof: 

[ 190 ] 

0 

= [ 189 ] [ 234 ] -[ 149 ] [ 238 ] 

9 

= [ 148 ] [ 234 ] [ 567 ] - [ 145 ] [ 238 ] [ 467 ] 

8 

= [ 145 ] [ 567 ] ([ 167 ] [ 234 ] + [ 123 ] [ 467 ]) 

7 

= - [ 124 ] [ 136 ] [ 234 ] + [ 123 ] [ 134 ] [ 246 ] 
6 

= 0 . 

Nondegeneracy condition: none. 



Rules 



[ 189 ] = [ 148 ] [ 567 ] 
[ 149 ] = [ 145 ] [ 467 ] 

[ 148 ] = [ 145 ] [ 167 ] 

[ 238 ] = -[ 123 ] [ 567 ] 

[ 167 ] = -[ 124 ] [ 136 ] 
[ 467 ] = [ 134 ] [ 246 ] 

[ 136 ] = -[ 123 ] [ 134 ] 
[ 246 ] = -[ 124 ] [ 234 ] 



Example 17 (Harmonic points, see also [5|, Example 6.236, and ca, Example 
4). If 6, 7, 8, B is a harmonic quadruple of points, then B is uniquely determined 

by 6, 7, 8. 

Free points: 1, 2, 3, 4, 5. 

Semifree point: 9 on 58. 
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Intersections: 

6 = 12 n 34, 7 = 23 n 14, 8 == 67 n 13, 
0 = 79 n 56, A = 69 n 57, B = 67 n 24. 

Conclusion: 0, A, B are collinear. 




Proof: 



[OAB] 

B 

= [246] [70A]- [247] [60 A] 

A 

= - [246] [570] [679] - [247] [567] [690] 

0 

= [567] [679] (- [246] [579] - [247] [569] ) 
9 [159] 

= ^=(-[246][578]-[247][568]) 
[158] 

8 

= - [567] [137] [246] - [567] [136] [247] 

7 

= [123] [134] [246] - [124] [136] [234] 

6 

= 0 . 



Nondegeneracy condition: [158] yf 0. 



Rules 



[70 A] = -[570] [679] 
[60A] = [567] [690] 

[570] = [567] [579] 

[690] = [569] [679] 

[158] [579] = [159] [578] 
[158] [569] = [159] [568] 

[578] = [137] [567] 

[568] = [136] [567] 

[137] = -[123] [134] 
[247] = [124] [234] 

[246] = -[124] [234] 
[136] = -[123] [134] 



Example 18 (See also |2|, Example 6.237, and US], Example 5). If the intersec- 
tions of five correponding sides of two complete quadrilaterals are on the same 
line I, then the remaining sides also meet in 1. 
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Free points: 1, 2, 3, 4, 5, 6. 

Semifree point: 7 on 12. 

Intersections: 

8 = 23 n 56, 9 = 13 n 78, 0 = 14 n 78, A = 24 n 78, 
B = 34 n 78, C = 57 n 69, D = 5A n 6B. 

Conclusion: 0, C, D are collinear. 




Fig. 18. Example 18. 

Proof: 

Rules [ocD] 

D 

= [50C][6AB]-[56B][0AC] 

C 

= [569] [570] [6 AB] + [56B] [579] [60 A] 

B 

= - [34A] [569] [570] [678] + [348] [567] [579] [60 A] 

A 

= [678] [234] [478] [569] [570] - [678] [240] [348] [567] [579] 

0 

= - [478] [147] [234] [569] [578] + [478] [124] [348] [567] [579] 
9 

= [567] [578] (- [138] [147] [234] - [124] [137] [348] ) 

8 

= [234] [356] (- [123] [147] + [124] [137] ) 

= 0 . 



[50C] = [569] [570] 

[OAC] = -[579] [60 A] 

[6AB] = -[34 A] [678] 
[56B] = [348] [567] 

[34A] = -[234] [478] 
[60A] = -[240] [678] 

[570] = -[147] [578] 
[240] = -[124] [478] 

[569] = [138] [567] 

[579] = -[137] [578] 

[138] = [123] [356] 

[348] = -[234] [356] 



Nondegeneracy condition: none. 
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Example 19 (Pascal’s theorem, see also 0, Example 6.390). 

Free points: 1,2, 3, 4, 5. 

Semifree point: 6 on 12. 

Intersections: 



7 = 34 n 15, 8 = 46 n 59, 2 = 16 n 39, 
A = 36 n 15, B = 45 n 69, 0 = 34 n 19. 

Conclusion: If 2, 7, 8 are collinear, so are 0, A, B. 

Reformulation of the theorem: 

Free points: 1, 2, 3, 4, 5. 

Semifree point: 6 on 12. 

Intersections: 



7 = 15 n 34, 8 = 27 n 46, 9 = 58 n 23, 
0 = 19 n 34, A = 36 n 15, B = 69 n 45. 

Conclusion: 0, A, B are collinear. 




Fig. 19. Example 19. 



Proof: 

Rules [OAB] 

B 

= [456] [90A]- [459] [60 A] 

A 

= - [136] [456] [590] + [156] [360] [459] 

0 

= [136] [159] [349] [456] - [139] [156] [346] [459] 

9 

= [235] [358] ([136] [158] [234] [456] + [123] [156] [346] [458]) 



[90 A] = -[136] [590] 
[60A] = -[156] [360] 

[590] = -[159] [349] 
[360] = -[139] [346] 

[159] = [158] [235] 

[349] = [234] [358] 

[459] = [235] [458] 

[139] = -[123] [358] 
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8 

= [456] [125] [136] [234] [467] + [456] [123] [156] [247] [346] 

7 

= [145] [234] [346] (- [125] [136] + [123] [156]) 

= 0 . 

Nondegeneracy condition in the proof of the reformulated theorem: none. 

Example 20 (See also 0, Example 6.28). 

Free points: 1, 2, 3, 4, 5, 6. 

Semifree points: 7 on 12, 8 on 13. 

Intersections: 



[158] = [125] [467] 
[458] = [247] [456] 

[467] = -[145] [346] 
[247] = [145] [234] 



9 = 14 n 56, 0 = 15 n 46, A = 37 n 28 , 
B = 34 n 89, C = 25 n 70, D = 58 n 30. 

Conclusion: A, C, D are collinear. 




Proof: 



Rules [ACD] 

D 

= [380] [5AC]- [350] [SAC] 

C 

= - [25 A] [380] [570] - [270] [350] [58 A] 

A 

= [258] [237] [380] [570] - [258] [270] [350] [378] 



[5 AC] = -[25 A] [570] 
[SAC] = [270] [58A] 

[25 A] = -[237] [258] 
[58A] = [258] [378] 
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[ 380 ] = -[ 140 ] [ 358 ] 
[ 570 ] = [ 157 ] [ 456 ] 

[ 350 ] = [ 135 ] [ 456 ] 

[ 270 ] = -[ 146 ] [ 257 ] 

[ 135 ] [ 378 ] = [ 137 ] [ 358 ] 



= - [ 146 ] [ 456 ] [ 157 ] [ 237 ] [ 358 ] + [ 146 ] [ 456 ] [ 135 ] [ 257 ] [ 378 ] 



8 

= [ 358 ](-[ 157 ][ 237 ] + [ 257 ][ 137 ]) 



= 0 . 



Nondegeneracy condition: [135] ^ 0. 

Example 21 (See also 0, Example 6.33). 

Free points: 3, 4, 6, 7. 

Intersections: 1 = 36 fi 47, 2 = 46 fl 37, 8 = 67 fi 34. 

Semifree points: 9 on 18, 0 on 12. 

Intersections: A = 28 fl 90, B = 36 fi 7A, C = 67 fl 39, 5 = 37 fi 60. 
Conclusion: B,C,5 are collinear. 

Reformulation of the theorem: 

Free points: 1, 2, 3, 4. 

Semifree points: 5 on 23, 9 on 12. 

Intersections: 

6 = 13 n 24, 7 = 23 n 14, 8 = 34 n 67, 0 = 56 n 18, 

A = 28 n 90, B = 7A n 13, C = 39 n 67. 



Conclusion: 5, B, C are collinear. 




Fig. 21. Example 21. 
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Proof: 



Rules 

[56B] = -[135][67A] 
[57B] = [137] [57A] 



[67A] = -[267] [890] 
[57A] = [290] [578] 



[890] = [189] [568] 
[290] = [156] [289] 



[123] [379] = 
[123] [189] = 
[123] [369] = 
[123] [289] = 



[137] [239] 
[128] [139] 
[139] [236] 
[128] [239] 



[568] = -[346] [567] 
[578] = -[347] [567] 



[267] = [124] [236] 
[347] = [134] [234] 



[346] = [134] [234] 

[156] = -[124] [135] 



[5BC] 

C 

= [379][56B]-[369][57B] 

B 

= - [135] [379] [67A] - [137] [369] [57A] 

A 

= [135] [267] [379] [890] - [137] [290] [369] [578] 

0 

= [135] [189] [267] [379] [568] - [137] [156] [289] [369] [578] 



g [128] [137] [139] [239] 



[123]^ 



-([135] [267][568] + [156][236][578]) 



= - [567] [135] [267] [346] - [567] [156] [236] [347] 



= - [236] [124] [135] [346] - [236] [134] [156] [234] 



= 0 . 



Nondegeneracy condition in the proof of the reformulated theorem: [123] ^ 0. 

Example 22 (Non-realizable 1 Os-configuration, see also fS), Example 9). 

Free points: 1, 2, 3, 4, 5. 

Semifree point: 6 on 12. 

Intersections: 7 = 23 n 14, 8 = 15 n 46, 9 = 25 n 36, 0 = 34 n 57. 
Conclusion: 8, 9, 0 are not collinear. 




Fig. 22. Example 22. 
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Proof: 

Rules [890] 

0 

= [357] [489] -[389] [457] 

9 

= [256] [348] [357] + [235] [368] [457] 

8 

= [346] [145] [256] [357] - [346] [156] [235] [457] 

7 

= [145] [235] [346] ([134] [256] - [156] [234]) 

= [125][145][235][346]^. 

Nondegeneracy conditions: [125], [145], [235], [346] yf 0. 

Example 23 (Fano’s axiom, see also [B|, p. 46, and P- 66). There is no 
complete quadrilateral whose three diagonal points are collinear. 

Free points: 1,2, 3, 4. 

Intersections: 5 = 12 fl 34, 6 = 23 fl 14, 7 = 13 fl 24. 

Conclusion : 5, 6, 7 are not collinear. 



[489] = [256] [348] 

[389] = -[235] [368] 

[348] = [145] [346] 

[368] = -[156] [346] 

[357] = [134] [235] 
[457] = [145] [234] 




Fig. 23. Example 23. 



Proof: 

Rules [567] 

7 

= [124][356] + [156][234] 

6 

= [124] [134] [235] + [123] [145] [234] 
5 

= -2[123][124][134][234]. 



[356] = [134] [235] 
[156] = [123] [145] 

[235] = -[123] [234] 
[145] = -[124] [134] 



Nondegeneracy conditions: [123], [124], [134], [234] ^ 0. 
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Abstract. A qubit is a two-state quantum system, in which one bit of 
binary information can be stored and recovered. A qubit differs from an 
ordinary bit in that it can exist in a complex linear combination of its two 
basis states, where combinations differing by a factor are identified. This 
projective line, in turn, can be regarded as an entity within a Clifford 
or geometric algebra, which endows it with both an algebraic structure 
and an interpretation as a Euclidean unit 2-sphere. Testing a qubit to 
see if it is in a basis state generally yields a random result, and attempts 
to explain this in terms of random variables parametrized by the points 
of the spheres of the individual qubits lead to contradictions. Geometric 
reasoning forces one to the conclusion that the parameter space is a 
tensor product of projective lines, and it is shown how this structure is 
contained in the tensor product of their geometric algebras. 



1 Introduction 

At its most fundamental level, quantum mechanics is pure geometry. In the non- 
relativistic case of interest here, it is in fact “just” three-dimensional Euclidean 
geometry. Although scientists in the field are basically aware of this fact, it is not 
widely known even by geometers. The following are four reasons for this state 
of affairs. 

The first is that the representations of the group of Euclidean motions with 
which quantum mechanics deals are not those that have been studied in “classi- 
cal” geometry; indeed, they are often over infinite dimensional complex function 
spaces! Nevertheless, as Felix Klein and Hermann Weyl have taught us, the el- 
ements of the carrier spaces of these representations are every bit as much a 
part of Euclidean geometry as lines and planes, even if they may be a great 
deal more difficult to visualize. It is in the interpretation of such mathematical 
structures, for example by means of suitable low-dimensional models, that ge- 
ometric concepts and methods can make important contributions to quantum 
mechanics. 

The second is the way in which these representations are combined when 
two or more quantum systems are merged into one, namely as the direct product 
of their constituent representations, rather than as the direct sum. Of course 
this is perfectly standard in mathematics today, but such composite representa- 
tions were seldom considered throughout the long history of geometry leading 
up to Klein, and they continue to receive relatively little attention from the 
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modern-day mathematicians working on classical geometry, probably because 
these representations become physically relevant (so far as is known) only at 
(sub)atomic scales. 

The third is that the actual transformations studied in quantum mechanics 
are motivated by physical considerations, and described in physical terms, which 
lie outside the purview of many mathematicians. This is true particularly of 
the interactions between different quantum systems, which induce nonclassical 
correlations between them and produce an “entangled” joint state that must 
be described by a nonfactorizable tensor. In addition to this language barrier, 
the notation used in quantum mechanics, especially its use of an uninterpreted 
imaginary unit, tends to obscure the underlying geometry. 

The fourth is the intrinsically stochastic nature of quantum measurements. 
Such measurements involve amplification of information about the state of a 
quantum system from the submicroscopic to the macroscopic level, a task not 
unlike (but considerably more drastic than) trying to pick up a live ant with 
a bulldozer. In general, such measurements destroy all correlations among the 
constituent parts of the system, which must therefore be inferred indirectly from 
the results of repeated measurements on ensembles of identically prepared sys- 
tems. The nondeterministic, and hence “nongeometric”, nature of this process 
is why Einstein so disliked quantum mechanics, and it is safe to say that more 
than half a century later nobody really understands “how it can be like that”. 
This is, once again, a matter of interpreting the mathematics, which nevertheless 
describes reality extremely well. 

The main message of this paper is that the computer-aided geometric reason- 
ing community is missing out on a great deal of excitement, and a potentially 
very appreciative audience, by limiting themselves largely to geometry as in- 
spired by the space in which classical physics is perceived to take placejj It will 
seek to illustrate this in the context of a particularly simple area of quantum me- 
chanics which is nonetheless of great contemporary interest, not just to physicists 
but also to computer scientists and growing number of mathematicians, namely 
“quantum computing” or (more generally) quantum information processing. The 
simplicity of this area is derived from that of the type of quantum systems it 
usually deals with, which are merely two-dimensional systems or qubits. 

In order to illuminate the geometric nature of qubits, the notation and lan- 
guage of the paper is derived from the most powerful tool for bridging the gap 
between the analytic and the synthetic that has been bequeathed to us by the 
grand masters of geometry from the 19th century, namely Grassmann’s “Cal- 
culus of Extension” as enlarged and streamlined by Hamilton, Clifford, Peano, 

^ Note that “classical” physics is usually considered to include special relativity — 
itself a very geometric subject with connections to non-Euclidean geometry. The 
book by Albert [Q contains a wonderful nontechnical introduction to the apparent 
paradoxes of quantum mechanics, whereas that by Peres |2| provides a more tech- 
nical and detailed account of how these paradoxes arise in trying to apply classical 
geometric reasoning to quantum systems. For an interesting interpretation of quan- 
tum mechanics within complex projective metric geometry, the reader may wish to 
consult m- 
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Cayley, Gibbs, Lipschitz, Chevalley, Cartan and Riesz together with (in more 
recent times) Hestenes, Sobczyk, Lounesto, and many others. Today it is most 
appropriately called (with apologies to E. Artin) geometric algebra. Among its 
many uses, it has become the basis of a number of symbolic algebra programs 
(as reviewed in j^) which, though intended primarily for either physical or ab- 
stract mathematical investigations, also have the potential to serve as potent 
aids to reasoning in the classical geometries. This potential has been discussed 
and illustrated in e.g. IH7l8tjll(illll2ll3l . and further progress along these lines 
is expected to be presented in this proceedings, as well. 



2 Quantum Information Processing 

We begin with a general, but necessarily rather brief, introduction to quan- 
tum information processing, using the language and notation of conventional 
quantum mechanics. Further details will be provided in a more geometric form 
over the following sections, and longer and gentler introductions are available in 



[E! 



Information, however it is conceived, exists only by virtue of being con- 
tained in the state of some physical system. Thus, although the information 
can have more than one meaning, it always has something to say about the 
state of the system it is stored in. Any N bits of binary information, for exam- 
ple, can be regarded as an abstract vector in Z^, though it may actually be the 
beads on an abacus. Similarly, quantum information can be regarded as a vector 
[^ij '02 7 • ■ • 7 0 at] G or as the state of a given multiparticle quantum system. 
In either case, it is the physics of its embodiment which determines what can be 
done with the information. 

In order to discuss these issues more concretely, a generic but simple model 
system is needed. This is a quantum computer, which encodes binary informa- 
tion in an ordered array of two-state quantum systems, or “qubits”. This en- 
coding is obtained by choosing a fixed pair of orthonormal vectors in the two- 
dimensional state space of each qubit, which are written in Dirac notation as 
I 0 ) = [1, 0], 1 1 ) = [0,1] and identified with binary 0, 1 respectively. The conju- 
gate transposes are denoted by ( 0 |, ( 1 1, respectively, and hence orthonormality 
means (OjO) = (Ijl) = 1 and (Ojl) = (IjO) = 0. By the previously men- 
tioned rules for merging multiple quantum systems into one, the state space of 
an Wqubit quantum computer is the (2'^)-dimensional tensor product of those 
of its qubits, which is spanned by the induced orthonormal basis 

= \S^)---\S^) = \S^---S^) ( 1 ) 

G {0,1}). These fV-qubit basis states may also be labeled by the 
integers k = 0, . . . , 2^ — 1 via their binary expansions, and denoted by \k). It 
should be understood that, although a classical analog computer can be built 
whose bits could be in arbitrary superpositions (linear combinations) 0o I 0 ) + 
0i| 1) (00 7 01 G C), its state space would be only 2Wdimensional over C. Thus 
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it is the exponential growth in the dimension of their state space which really 
distinguishes quantum computers from classical. 

The extra dimensions are due to the existence of nonfactorizable or entangled 
states, which exhibit a number of nonintuitive properties (cf. m)- These in turn 
are closely tied to the stochastic nature of quantum mechanical measurements. 
These measurements are quantized, so that measuring a qubit always yields one 
of the two possible outcomes 0 or 1, and leaves the system in the corresponding 
basis state | 0 ) or 1 1 ). Which outcome will be obtained can be predicted only if 
the system is already in one of the two basis states, whereas measurements on su- 
perpositions yield random outcomes. In particular, the information in entangled 
states is contained in the correlations between the outcomes of measurements on 
different qubits. As a result of this intrinsic randomness, even though it takes an 
exponentially increasing amount of classical information to specify the state of 
an iV-qubit quantum computer to any fixed precision, at most N bits of classical 
information can be stored in and recovered from it (hence the name, “qubit”) 

Having discussed the nonintuitive features of a quantum state, we now turn 
to the operations which can be performed on it, again in the context of an 
ideal quantum computer. According to Schrodinger’s equation, isolated quantum 
systems evolve in time by unitary transformations. The subgroup of the unitary 
group U(2'^) consisting of all permutations of the basis states | k) corresponds 
to the set of all invertible boolean transformations of Z^. It is known that, 
with at most a polynomial number of constant inputs and unused outputs, any 
boolean transformation can be embedded in an invertible one lEI, and hence 
a quantum computer is computationally universal. In fact, the entire unitary 
group is generated by local operations acting on at most two qubits at a time 
m, which is convenient because all elementary physical forces are two-particle 
interactions. 

Let us illustrate this with some specific “quantum logic gates” . The simplest 
is just the NOT of a qubit, which has the following matrix representation in the 
computational basis: 



7V|0) 



0 


1' 




'I 




'o' 


1 


0 




0 




1 



| 1 ) 



( 2 ) 



A one-bit gate without a classical counterpart is the Hadamard gate, which maps 
basis states to superpositions: 



ff|0) = 
H\l) = 



1 1 ■ 




■f' 


1 -1 




0 


1 1 ■ 




'o' 


1 -1 




1 



- 2 - 1 / 2 - 

2 - 1/2 

- 2 - 1/2 - 
- 2 - 1/2 




= (|0) + |l))/y2 

= (|0)-|1))/V2 



( 3 ) 

( 4 ) 



Note that both these operations are involutions, in that = 1. 

Interesting calculations require that operations on one qubit be conditional 
on the state of one or more others. Since we cannot read a qubit in a superposition 
without destroying that superposition, this conditionality must be built into a 
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unitary transformation which acts simultaneously on all the qubits. Clearly con- 
ventional logic gates like the AND are not possible, since these are not invertible 
(two inputs but only one output!). The most common example of a two-bit gate 
is the controlled NOT, or c-NOT, gate, which has the matrix representation 



^2|l 



10 0 0 
0 10 0 
0 0 0 1 ’ 
0 0 10 



and so acts upon the computational basis as follows: 

|00) = I 00), 5^1^ I 01) = I 01), 
5^1^110) = 111), 5^1^111) = |10) 



( 5 ) 



( 6 ) 



Thus the flips the second (right) qubit whenever the first is 1. This single 
two-bit gate, together with all possible one-bit unitary operations, is known to 
generate the entire unitary group U(2'^) [ I S] . The number of operations required, 
however, can be exponential in N. 

Much of the current interest in quantum computers is due to the spectacular 
discovery of a polynomial-time quantum algorithm for the integer factorization 
problem m- Since every known classical algorithm is exponential, this implies 
that quantum computers violate that cornerstone of computational complex- 
ity known as the strong Church-Turing thesis, which states that the distinction 
between exponential and polynomial-time algorithms does not depend on the 
underlying hardware. Such speed-ups rely upon the linearity of quantum evolu- 
tion, so that a unitary operation on the basis states operates simultaneously on 
all terms of a superposition, as follows: 

£^ELo^Cfc|A:) = J2k=o^ CkU\k) (7) 

This shows that a quantum computer offers a degree of parallelism which po- 
tentially grows exponentially with the size of the problem. The catch is that 
the above-mentioned limitations on the amount of information available from 
quantum measurements prevents us from simply reading out the results (which 
would require exponential time and memory anyway). The trick, therefore, is to 
find an additional quantum operation that maps the final superposition back to 
a basis state which reveals the desired answer, without knowledge of the super- 
position. Such operations, generally some form of Fourier transformation, have 
been discovered for only a very few problems to date, and in particular, quantum 
computers are unlikely to be a panacea for N P-complete problems m- 

Having described the operational features of quantum computers, we turn 
our attention to their underlying geometric structure. 



3 The Pauli Algebra 

The state space of a single qubit, being a two-dimensional vector space over C, 
appears very different from physical space. A relation between the two is obtained 
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Fig. 1. The Bloch vector of a qubit being subjected to a continuous rotation on the 
Riemann (unit) sphere, starting from its reference position along the vertical z-axis. 
The spirals in the complex xy-plane are its stereographic projection (outside the sphere) 
together with that of its antipode (inside), as indicated by the dashed lines from the 
north pole. This action of SO(3) on the Bloch vector is induced under stereographic 
projection by certain Mobius transformations (or homographies) of the complex plane, 
which in turn are induced by the standard representation of the special unitary group 
SU(2) under taking ratios of the complex coordinates V'l/V’o of qubit state vectors. 
This mapping is a two-to-one homomorphism of SU(2) onto SO(3), in accord with 
the well-known Lie algebra relationship between these groups: SU(2) is the universal 
(i.e. simply connected) covering group of SO(3). On the projective line given by the 
ratio tpi/'>Po, the antipode corresponds to the harmonic conjugate, with respect to the 
projective points of the north and south poles, of the Bloch vector’s projective point. 



by viewing the components of a general state vector \tp) = '0o I 0 ) + V’l I 1 ) tts the 
Cayley-Klein parameters for a three-dimensional Euclidean rotation P2], which 
are given in terms of the Euler angles by 

ipo = cos(t^/2) , i/’i = — tsin(r?/2) (8) 

(wherein = —1). These parameters determine an element of the two-fold 
universal covering group SU(2) of S0(3), namely 






fpo -i’t 
,^i i’o , 



G SU(2) , 



( 9 ) 



and hence are determined by the rotation up to sign. This identification makes it 
possible to map the state of the qubit to a three-dimensional unit vector, which 
is obtained from a given reference vector by applying the rotation to it. Under 
this mapping, the states | 0 ) and | 1 ) coincide with the reference vector and its 
negative, while the physically irrelevant net phase exp(t^) corresponds to the 
angle of rotation about the reference vector. 

Assuming that the reference is a unit vector along the z-coordinate axis, the 
transformed vector, called the Bloch vector, is given in terms of an orthonormal 
basis (Tx , <Ty , <Tz by 



sin(i?) cos(:p)crx + sin(i?) sin((/j)<Ty -|- cos(^?)<Tz , 



(10) 
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SO its polar coordinates are Euler angles. This “classical” model of a qubit’s state 
which corresponds to the stereographic projection of 'i/'i/V’o onto the Riemann 
sphere (see Fig. ifl has a number of interesting implications. First, it associates 
the computational basis | 0), | 1) with a direction in physical space, which (in 
any physical realization of qubits) is determined by the measurement apparatus. 
Because physical rotations of the apparatus induce passive transformations of 
the underlying state space, the question of whether a qubit is in a superposition 
state or a basis state is geometrically meaningless; whether or not they are 
entangled, however, turns out to be basis independent. More importantly for the 
present purposes, the model enables the standard matrix formulation of quantum 
mechanics to be embedded in the natural algebraic structure of a metric vector 
space, namely its Clifford or geometric algebra (cf. ^3E32S|221). 

The geometric algebra of a three-dimensional Euclidean vector space, also 
called the Pauli algebra G3 , is the associative algebra generated by a basis subject 
to the relations -I- 2 {gL,v G {x, y,z}). These relations readily 

imply that the Euclidean inner product is given by the symmetric part of the 
geometric product, a b = (ab+ba) /2, while the antisymmetric part corresponds 
to Grassmann’s outer product a A 6 = {ab — ba)/2. A linear basis for this eight- 
dimensional algebra is given by: 




Oxfy, OiCTx, CTyCTz 
axCTyCTz 



(scalars, 0-dimensional) 
(vectors, 1-dimensional) 

(bi vectors, 2-dimensional) 
(pseudo-scalars, 3-dimensional) 



( 11 ) 



In particular, the unit pseudo-scalar l = ayCTyCTz is easily seen to square to — 1 
and commute with everything in the algebra, thereby providing a geometric 
interpretation for the imaginary unit of qubits (and many other physical entities) . 
Just as a real number can be interpreted as either a magnitude (e.g. length = 2) 
or as an operator (e.g. scale by 2), i can be interpreted as either the oriented 
volume element of space, or as an operator that maps vectors to their orthogonal 
bivectors. The latter shows that the outer product of two vectors a, b is related 
to their cross product by: 

a Ab = b{a x b) (12) 

In addition to its many applications to classical geometry and physics 
the Pauli algebra provides a concise mathematical encoding of the physics of 
qubits (be they nuclear spins, photon polarizations, electronic states of atoms, 
etc.), as follows. The even subalgebra is that generated by the products of 
even numbers of vectors, namely by the identity together with the basis bivectors. 
Its multiplicative subgroup of unit norm elements is isomorphic to SU(2), which 
in turn is isomorphic to the multiplicative group of unit quaternions. A time- 

^ For many further illustrations of these beautiful morphisms between seemingly 
different geometries, the reader is referred to the delightful book “Visual Complex 
Analysis” by T. Needham pT| . 
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dependent rotation of a vector v is given by its conjugate UvU ^ with a one- 
parameter subgroup of , namely 

U = exp(—uvtu/2) = cos(w</2) — msin(wt/2) , (13) 



where w is the angular velocity and u a unit vector on the axis of rotation. 
The inverse U~^ = exp{Lujtu/2) = IJ is obtained by reversing the order of the 
factors in the terms of an expansion of U relative to the above basis, thereby 
changing the sign of its bi vector part. In most physical realizations of qubits, 
H = LOU is the Hamiltonian “operator” for the interaction of the qubit with a 
constant external field which induces the rotation. 

Elementary idempotents of the form = {l + v)/2 with = 1 play several 
important roles in the algebra and the physics. First, these idempotents describe 
a qubit’s state as the sum of a scalar (1/2) and the vector {v/2) obtained by 
applying a rotation S' G to the reference vector (conventionally taken to be 
cg/2), 

= i(l + u) = «l^i(l + cr,)t^ = ^E+^. (14) 

The reference idempotent and its complement will henceforth be written as 
E± = E±a-^, so that E+ GG |0)(0| and E^ O |1)(1| (where O indicates 
a correspondence between geometric and coordinate-based objects). Although 
the scalar part may seem superfluous, it will be seen shortly that it provides a 
normalization factor needed to describe the statistics of ensembles of qubits. 

Second, right-multiplication by the reference idempotent projects the algebra 
onto a left-ideal with half as many (i.e. four) degrees of freedom, on which the 
even subalgebra acts from the left to give a representation of K* © SU(2). This 
allows the elements of the left-ideal to be interpreted as qubit state vectors, 
since they transform in the same way. Explicitly, if we choose a representation 
in which cr^ and hence £1+ is diagonal, we have: 



V’o 


-r: 




'1 o' 




V’o 0 




V’S _ 




0 0 




.V’l 



(V'o|0)+V’i|l))(0| (15) 



It follows that 'PE^^E^ — tPE^'ipQ is the inverse of our mapping from qubit 
state vectors to spatial vectors (up to phase and normalization) . 

Third, E.^ is the operator for a measurement that yields 1 if the qubit is 
in the state E„ and 0 if it is in the orthogonal state E^^, . In accord with the 
properties of quantum measurements introduced previously, when applied to a 
state Ej^, this measurement irreversibly “collapses” it onto one of the two states 
E ±^ , each with probability equal to 



2{E^E±^)^ = l{l±wv) , (16) 

where (_)q denotes the scalar part (or half the trace in the above matrix repre- 
sentation). Thus we see that, curiously enough, the geometric relations among 
Euclidean vectors representing states and operators are what determines the 
statistics of quantum measurements. This equivalence will now be extended to 




236 Timothy F. Havel 



ensembles of qubits, wherein the qubit states’ statistical frequencies represent a 
probability density function over the state space. 

In the matrix formulation of a qubit mechanics |2] , the expectation value of 
a measurement is given by a quadratic form in the state vector components, e.g. 

= tr(,Ej^)(V^|) o 2{E,E^), = ^{1 + v w) , (17) 

which is an ajfine form in the Bloch vector w. This allows us to write the average 
of the expectation value over a probabilistic ensemble { Pi , | V’z ) } {Pi ^ 
Y^iPi = 1) of qubits as 

YiP^{'<P^\E^\'^i) = ‘^{EvY^PiE^i)o = + ■ ( 18 ) 

It follows that the average of any test over the ensemble is determined by a Bloch 
vector rjw = YiPi'^i of length 77 < 1, or equivalently, by the corresponding 
density operator p = ^{1 + rjw) . The “coherence” of the ensemble 77 is related to 
the average of the density operator versus itself by 77 ^ = 4( )q — 1. An ensemble 
is called pure when 77 = 1, meaning that p O \il)){'ip\ for a single state vector 
1 7/1), and mixed otherwise. 

4 Multi-qubit Systems 

Extending these geometric interpretations to multi-qubit systems necessitates 
taking the A^-fold tensor power of C/3 0 The subalgebra generated by the op- 
erators for any one qubit is thus a copy of the Pauli algebra as above, while 
the operators for different qubits commute freely. Thus an arbitrary element of 
the algebra, denoted by can be written as a linear combination of produet 

operators, 

“a* > (^^) 

/tG{0,x,y,z}^ 

where the superscripts are qubit indices (written here arbitrarily in increasing 
order), and for notational convenience erg = 1. The coefficients are hyper- 
complex numbers in the center of the algebra of the form 

^ • • • 7 ,“^- , ( 20 ) 

where £ M and t'* is the unit pseudo-scalar of the (7-th qubit {q = 1, ... ,N). 

It follows that Gf^ has dimension 2^^ as a linear space over K. In the 
conventional quantum mechanics of qubits, however, one works in the algebra 
of 2^ X 2^ matrices over C, which has real dimension 2^^+^. These apparently 
superfluous degrees of freedom arise from the fact that Gf^ contains a different 

® A physical justification for this may be found in the fact that this algebra is obtained 
from the geometric algebra of a direct sum of N copies of Minkowski space-time by 
choosing an inertial frame in each E2E1- 
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unit pseudo-scalar for each qubit q = 1, . . . , TV. They can be projected out by 
working in the principal ideal generated by the so-called correlator idempotent: 

C = (21) 

It is easily seen that = —C for all p,q G {1, . . . , N}, which enables us 

to simply drop the qubit labels on the t’s and use a single imaginary unit as 
in conventional quantum mechanics. Also, since C commutes with the entire 
algebra, it too can be dropped for most purposes, as will be done from here on. 

The resulting algebra Gf^{C) is isomorphic to the algebra of all 2^ x 2^ 
matrices over C, but the entities within it no longer depend on an arbitrary 
choice of coordinate frames. As with a single qubit, there are two ways to view 
the states of an TV-qubit system E3- The first is as a left-ideal in the product 
of the even subalgebras (03^)'^^(C'), namely where 

11^^11 = 1, E+ = E\---E^, ^ i(l + ^9) (22) 

{q G {1, . . . , TV}). The second is as the corresponding density operator: 

xp = ^E+^ (23) 

The density operator formalism has the advantages that it is closer in most re- 
spects to conventional quantum mechanics, that it explicitly exhibits the duality 
between states and their transformations (namely multiplication by l), and that 
it can be used to describe statistical ensembles as well as subsystems of entan- 
gled quantum systems (see below). For these reasons we will concentrate on the 
density operator formulation in the following. 

Let us now consider how to describe logical operations in the algebra, and 
their geometric interpretations (cf. E3). The simplest such gate is the NOT of 
a single qubit TV = —tcr^, which is just a rotation by tt about x. This acts on 
the initial density operator £1+ o | 0 )( 0 | as: 

-L(Ty,E+L(Ty, = i(l -L O-xCr^O-x) = i(l - (<Tx)^(Tz) = E_ O- |1)(1| (24) 

That is to say, —lct^ = exp(— t<Tx 7 r/ 2 ) both generates and is itself the desired 
logic gate. Similarly, the Hadamard gate is a rotation by tt about the axis 

(<Tx -f ct^)/a/ 2 : 

H = exp t 7 r(<Tx -f <Tz)/\/ 8 ^ = — l{ct^ + cr^) j \/2 (25) 

We leave it as an exercise to show that this acts on the basis states E± as 
previously described using matrices. 
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Finally, the c-NOT gate is given in coordinate-free form a^: 

521 1 = El + (26) 

This can also be written in exponential form as = eyi]y{—LE^(Tl'K /2) . It is 

easily shown that it acts on the basis states as 

5 ' 2 |i£;^i £;2 ^ 2 |i ^ eIe1^2 (e\e^e{±l}), (27) 

so it takes the NOT of the second qubit whenever the first is E^ . The c-NOT gate 
thus illustrates yet another use for the elementary idempotents: they describe 
the conditionality of operations. Alternatively, from a geometric standpoint, a 
c-NOT gate is a rotation by tt in the left-ideal defined by E^_ only. 

The following are some physically significant properties of multi-qubit density 
operators: 

— A density operator p is pure if it is idempotent, meaning (p)^ = p, and 
mixed otherwise. 

— It is factorizable if p = p^ • • • p^, where each p'^ {q = 1, . . . , N) can be 
expressed in terms of operators acting on just the q-th qubit, and correlated 
if it is not factorizable. 

— It is separable if there exists an ensemble for it whose individual states are 
factorizable (unentangled), meaning 

p = EfcPfc I ,^8) 

Note that a nonseparable density operator (or ensemble) is necessarily correlated, 
and that the converse holds if p is pure. It is in general difficult to determine 
if a given density operator is factorizable or not, and even more difficult to 
determine if it is separable. These problems have been completely solved only 
in the case of two qubits, thanks in large part to the Schmidt decomposition of 
general bipartite quantum systems. The Schmidt decomposition of a two-qubit 
pure state is 

lx) = , (29) 

and is easily derived from the singular value decomposition of the 2x2 matrix 
obtained from the four entries of the vector | y). A pure state is unentangled if 
and only if one of the two real coefficients Ci -k cl = 1 vanishes. 

* The astute reader will have observed that the phase of the states transformed by 
this operator differs from that of the matrix Eq. Q; since this has no effect upon 
the basis states, the c-NOT given here is an equally valid extension of the classical 
gate to superpositions. 
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The problem of interpreting multi-qubit density operators geometrically is 
perhaps even less well-understood, so we restrict ourselves to the two qubit case, 
and consider the product operator expansion of a general density operator: 

P — ^M6{x.y.z} Si^e{x,y,z} 

X//jg{x,y,z} X/i/g{x,y,z} 

The orthogonality of the product operators implies that coefficients in this ex- 
pansion are the ensemble average expectation values of the corresponding prod- 
uct operator. The first term, of course, transforms as a scalar, while each of 
the summations over the a’s transform as vectors. The /3’s are the correlations 
among the components of these vectors. The summation over the /3’s transforms 
as a rank 2 tensor, which decomposes into three orbits under the simultaneous 
action of the rotation group on both qubits together: 

a scalar, + (3yy + (3^^){cT^cr‘^ + -b ; 

a vector, i(/3yx - ^^y)((Tlcr^ - crjcr^) + i(/?xz - /?zx)(cr,!<7'2 - cr^a^) 

+ \{Pzy - l3yz){(rj(T^ - (T^CTy) 

(since this does not change sign under inversion in the origin, it would be more 
accurate to say it transforms as a &ivector!); 

and a symmetric traceless rank 2 tensor, 

(|/3xx - ^Pyy - + (-|/3xx + |/3yy ~ ^Pzz)cTy(T^ 

+ (-|/3xx - |/3yy + l0zz)a-^(T^ + ^(/^xy + Pyx){eT^O-y + (Ty (T^) 

+ \{Pzz + + Cr^^Cr2) -b i(/3y, -b /3zy)((Tylcr2 -b (T^(T^). 



The vanishing of the trace of this tensor implies that the corresponding quadratic 
form is a second-order solid harmonic. A basis for such an irreducible represen- 
tation of the rotation group acting identically on all qubits together constitutes 
a system of covariants for the rotation group; an invariant is a system consisting 
of a single covariant. 

The group of interest in qubit geometry is actually SU(2 D (SU(2))®^, 
the (2^)-fold cover of (S0(3))®^. This is because entanglement can only be 
created by rotating two subsystems with respect to one another in the larger 
space that is allowed by entanglement, whereas local operations in (SU(2))®^ 
are generated by the rotations of single qubits. In any physical realization of 
qubits, local operations correspond to interactions of the qubits with external 
fields, whereas entangling operations involve interactions between qubits. The 
entangling operations convert the representations of (50(3))®'^ on density op- 
erators into an infinite family of similar representations, which can in principle 
be classified by the values of the invariants. It is known, for example, that the 
invariants of a two-qubit density operator are generated algebraically by only 
ten fundamental invariants, and although such a minimal system has yet to be 
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exhibited, 21 linearly independent invariants have been found using computer 
algebra methods m- Also in the case of two qubits, a geometric interpretation 
of the Schmidt decomposition has been given m 

Up to this point, we have assumed we are dealing with a quantum computer 
that is perfectly isolated from its environment, so that a pure ensemble evolves 
coherently into another pure ensemble. The main reason why a quantum com- 
puter of any substantial complexity has yet to be built is that it is very difficult 
to achieve a sufficient degree of isolation in the laboratory, while still being able 
to interact with the system as needed to perform logical operations on its state. 
The interactions between a system and its environment generally result in their 
mutual entanglement, and since the environment is (by definition) unobservable, 
this in turn results in a loss of accessible information on the state of the system, 
which is otherwise known as decoherence. The system may then be described 
conceptually by an ensemble of quantum states which reproduces the statistics 
of measurements on it, or more compactly, by the density operator of such an 
ensemble. 

To illustrate this, suppose that the “environment” itself consists of a single 
qubit, which interacts with a second “system” qubit so as to produce one of the 
so-called Bell states: 

^2|ig.5)V4|00) = 5^|1(|00)- |10))/V2 

= (|00)-|ll))/y/2 = !<(._) . 

The corresponding density operator is 

I «<>-)(</'- I ^ P = 3 +<^y<^y , 

which exhibits the correlations between the qubits explicitly. None of the terms of 
this expression can be observed without performing simultaneous measurements 
on both qubits save for the identity 1, and hence the density operator of the 
system qubit #2 alone is simply 

= i o 1(|0)(0| + |1)(1|) . (33) 

This in turn corresponds to a completely mixed ensemble wherein half the qubits 
are in each of the states | 0 ) and | 1 ), meaning that all information about the state 
of the system has been lost to the environment. The corresponding mathematical 
operation is called the partial trace over the first qubit, which means dropping 
all terms from the product operator expansion of p depending on that qubit, and 
renormalizing by multiplying the remainder by two. It should be noted, however, 
that even if the environment qubit is also measured, the state | 4>- ) cannot be 
distinguished from its complementary Bell state \4>+) = (|00) -I- |ll))/\/2. 
In other words, one of the two bits of information contained in this system 
is fundamentally hidden by its entanglement, and can be recovered only by 
disentangling them. 



(31) 

(32) 
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5 An Entanglement Paradox 



On adding a third qubit to the Bell state | (/)_ ) = | 4>^ ) above and performing 
another c-NOT on it, we get: 



= 5^1^(1000) - 1 110))/V2 

= (|000) - |lll))/\/2 = I GHZ) 



(34) 



This is the highly entangled Greenberger-Horne-Zeilinger state PI. the density 
operator of which is | GHZ ) ( GHZ | o Pqhz = 






cr, (T, 



CT (T-, — <J^ CT^ CT 



^1^2 _3 

<Ty <Ty (T^ 



^1^2 _3 
(j^cT^cr^ 



’’xCTyO-y ) • (35) 



It will now be shown that the statistics of measurements performed on an en- 
semble of three-qubit systems, all prepared in the GHZ state as described above, 
cannot be reproduced by any probabilistic ensemble of systems wherein the out- 
comes of the measurements are determined by the prior states of the individual 
qubits PI- This shows, in turn, that these statistics must be determined by 
parameters which are pertinent to more than one qubit. If one accepts that the 
Bloch vector fully characterizes the statistics of measurements on single qubits 
and seeks to maintain the linearity of quantum mechanics, then the most natural 
such parameter space is the full tensor product of the Bloch vectors of the three 
qubits. Empirically, it has been found that these parameters are also sufficient 
to fully characterize the statistics. 

First, let us consider the results of measuring the individual qubits along z, 
namely cr^ ^ , <r^ . These form what is called a complete system of commuting ob- 

servables, meaning that their simultaneous eigenstates are uniquely determined 
by the outcomes of measuring them (i.e. by their eigenvalues ±1). Hence knowl- 
edge of these outcomes is sufficient to predict the result of measuring any other 
commuting observable. Since taking the partial trace over any two of the qubits 
in Pghz yields the density operator 1/2, the outcomes of these measurements are 
all completely random (i.e. have a 50% chance of being either ±1). Now consider 
the three product observables {qi ^ q 2 € {1, 2, 3}), the outcomes of which 

are 1 when both Bloch vectors come out parallel along z, and —1 if they come 
out antiparallel. Since the terms all occur in the above expansion of Pqhz 

with a coefficient of 1/8, the expectation values are 2^ ( Pqhz )o “ ^ 

all these observables, which (since their outcomes must be ±1) shows that their 
outcomes on any given system are 1 with probability one. Thus any two qubits 
must always yield the same value when measured, and this further implies that 
the outcomes are also the same when all three qubits are measured. It follows 
that the outcomes of measurements of the qubits in the cr^ computational basis 
are compatible with a “classical” ensemble, which consists of an equal mixture 
of systems in each of which all three qubits are either | 0 ) or 1 1 ) . This might 
lead one to believe that the state | GHZ ) is unentangled, but nothing could be 
further from the truth! 
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Now let us consider the results of measurements along the x and y axes, 
noting that any three of the observables cr^ or cr^ with distinct qubit indices 
q = 1,2,3 form a complete system of commuting observables. Once again, the 
result of measuring any one of these observables is completely random, but now 
the result of measuring any two of them is likewise completely random (i.e. all 
four possibilities occur with equal frequency) . To show that certain combinations 
of all three outcomes are nonrandom, we observe that the three product operators 



A — d-A R — 3 

-/x CTy ^ ±J 



y ’ 



U = CTy CTy (T^ 



(36) 



form a complete system of commuting observables, so that the fourth commuting 
observable 

D = (37) 



can be predicted when the outcomes of the first three are known. 

Now, it follows from their coefficients in the above expansion of Pqhz that 
the expectation values of A, B and C are 1, while that of D is —1. Therefore, 
if we measure three commuting observables of the form , cr ^'^ , cr® [qi yf g 2 
<73 7^ 91)1 we can be sure that either all three outcomes are 1, or else exactly 
two are —1. If . . . , My are the predetermined outcomes of the individual 

measurements on some arbitrary system in the GHZ state, the expectation values 
of A, B and C show that 



= 1 . (38) 

Thus the product of all three triples of outcomes is likewise 



1 = , (39) 

since (M^)^ = (±1)^ = 1 for q = 1,2,3. But this contradicts the fact that the 
expectation value of D is —1, since that implies the product of the three 
should be —1 with probability one. For those who would like a more detailed 
proof, the following table shows that all combinations of signs for the x (rows) 
and y (columns) outcomes are excluded by the expectation values of at least one 
of the observables A through D. 

At this point, one might begin to doubt that quantum mechanics is a self- 
consistent theory. The thing that saves it is the Heisenberg uncertainty principle, 
which forbids us from making simultaneous measurements of noncommuting op- 
erators such as and er^. In the above we start from an ensemble of triples 
of qubits, each of which is in the quantum state | GHZ ), choose one of the four 
triples of measurements given in Eqs. m, (El at random, and perform those 
measurements on the qubits one-at-a-time (as must be done if they are widely 
separated in space). Although each qubit measurement produces a random out- 
come, they are perfectly correlated in the sense that the products of the three 
outcomes for A, B, C are always 1 while that for D is always —I. Because of 
the Heisenberg uncertainty principle, we are never able to actually measure more 
than one of these four triples on any single system, but since the correlations for 
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any given triple are always the same, no matter which member of the ensemble 
we take, we assume (quite reasonably) that it does not matter. In other words, 
we assume that if we had measured D rather than A (say) in any given case, 
we would have seen the same correlations as we always had before (of course we 
could measure the A triple, and then the £>, but then the outcome of D would 
show no correlations, according to Heisenberg). This is known as contrafactual 
reasoning. The amazing thing is that this “cosmic censorship” corresponds to 
the orthogonality of two vectors cr^ and ar^ in the space of a single qubit! 

This paradox, which has been tested experimentally EiEa, indicates that 
qubit geometry must be based on a tensor product space, rather than a direct 
sum of the underlying state spaces as in the geometry of classical physics, be- 
cause only a tensor product space is big enough to independently specify all the 
correlations. The study of such spaces provides geometers with fertile new ter- 
ritory for their methods, which despite considerable study among physicists has 
not yet been explored with anything approaching the thoroughness of classical 
geometry. Geometric algebra is applicable in both domains, and computational 
methods for geometric reasoning based on geometric algebra (see Introduction) 
should be extended to the study of multi-qubit geometry, with the goal of gaining 
deeper insight into the structure of entanglement 

Finally, the construction of large-scale quantum computers is currently a 
matter of intensive research in many laboratories Although the primary 

motivation for this work is their unique digital information processing capabili- 
ties, they will also be extremely useful as analog devices which can simulate other 
quantum systems (see e.g. [L3til37|3Sj ) . This also makes it possible, however, for a 
quantum computer to operate directly on tensor products of geometric algebras, 
as well as (by computing in suitable homomorphic images) many other algebraic 
structures. This may enable quantum computers (when one is eventually built!) 
to solve algebraic and geometric problems that are far beyond the reach of any 
possible computer based on classical physics alone. 
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Abstract. This paper describes ongoing work in our formal investiga- 
tion of some of the concepts and properties that arise when infinitesimal 
notions are introduced in a geometry theory. An algebraic geometry the- 
ory is developed in the theorem prover Isabelle using hyperreal vectors. 
We follow a strictly definitional approach and build our theory of vectors 
within the nonstandard analysis (NSA) framework developed in Isabelle. 
We show how this theory can be used to give intuitive, yet rigorous, 
nonstandard proofs of standard geometric theorems through the use of 
inhnitesimal and infinite geometric quantities. 



1 Introduction 

In our previous work on the mechanization of Newton’s Principia^ we introduced, 
through a combination of techniques from geometry theorem proving (GTP) and 
nonstandard analysis (NSA), the notion of an infinitesimal geometry in which 
quantities can be infinitely small wm- The main aim was to capture and 
mechanize the limit or ultimate notions used by Newton in his proofs, while 
respecting as much as possible his original geometric arguments. 

Our formalization task, within the interactive framework of Isabelle, was 
made possible through the use of concepts from powerful — yet geometrically 
intuitive — GTP techniques known as the signed area and full-angle methods 
m- These methods were highly adequate to our goals as they provided us 
with lemmas powerful enough to prove the results we wanted but also used 
geometric notions such as areas and ratios of segments that were directly relevant 
to Newton’s proofs. 

In the current work, however, we depart to some extent from the framework 
already established in Isabelle for geometry. Our aim, now, is to formally explore 
the properties of the infinitesimal geometry theory developed in Isabelle. To this 
end, we formulate an alternative treatment of geometry based on the notions 
of hyperreal vectors. We want to provide a rigorous yet powerful theory that 
can capture formally the properties of our geometry, as well as provide a secure 
foundations for our previous work. 

Moreover, the approach we describe in this paper also differs from that pre- 
viously adopted in that it is fully definitional. In other words, we now formally 
define and derive all mathematical notions rather than postulate any of them. 
This approach guarantees consistency, which cannot be ensured when axioms 
are introduced (see Section mi for a brief overview of this methodology). 
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In what follows, we first offer a brief review of some basic notions from 
nonstandard analysis that will prove useful to our discussion (Section EJ. Next, 
we give an overview of the vector theory developed in Isabelle (Section EJ • In 
particular, we briefly review the vector algebra, the vectorial definitions used for 
familiar geometric properties, and some of the infinitesimal geometry theorems 
that follow. We then outline some of the novel infinitesimal geometric concepts 
formalized in the work so far (Section EJ. We then describe a new approach, 
based on nonstandard methods, that can be used for proving familiar (standard) 
geometry theorems (Section E|- Finally, we outline some of the further work 
currently in the pipeline (Section EJ and share some of the conclusions we have 
reached so far (Section 0). 

2 A Few Concepts from Nonstandard Analysis 

The definitions below describe the various types of numbers that exist in the 
nonstandard universe introduced by nonstandard analysis. They provide some 
of the basic NSA concepts needed to follow this paper. 

Definition 1. In an ordered field extension IR* 3 IR, an element x G IR* is 
said to be an infinitesimal if |a;| < r for all positive r G IR; finite if |a:| < r for 
some r G IR; infinite if \x\ > r for oZZ r G IR. 

The extended, richer number system IR* is known as the hyperreals P3] and 
has been developed in Isabelle through purely definitional means using the so- 
called ultrapower construction H3|. 

Definition 2. x,y G IR* are said to be infinitely close, x ~ y if x — y is in- 
finitesimal. 

This is an important equivalence relation that is crucial to NSA and also 
to both our past and current work. We present an extension of this relation 
to hyperreal vectors in Section roi and use it to define the various properties 
investigated by this work. 



3 A Mechanized Theory of Hyperreal Vectors 



Apart from using an interactive (hence slower) approach to GTP, the current 
work also differs from the traditional automated approach by residing within 
the higher-order logic framework of Isabelle/HOL [inj. One of the main reasons 
for choosing Isabelle/HOL is that it provides a rigorous approach to the for- 
malization of the infinitesimal — a notoriously difficult task. The suitability of 
Isabelle/HOL for our formalization stems mostly from the benefits gained by 
adopting the so-called HOL methodology. This is briefly examined next. 
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3.1 The HOL Methodology 

The HOL methodology, which derives from work done by Gordon in the HOL88 
theorem prover ini> admits only conservative extensions to a theory. This means, 
as we already mentioned in the introduction, defining and deriving the required 
mathematical notions rather than postulating them. The definitional approach 
of HOL requires that assertions are proved about some model instead of being 
postulated. Such a rigorous definitional extension guarantees consistency, which 
cannot be ensured when axioms are introduced. As pointed out by Harrison 
H3, such an approach provides a simple logical basis that can be seen to be 
correct once and for all. With regards to the foundations of infinitesimals, the 
definitional approach is certainly advisable when one considers the numerous 
inconsistent axiomatizations that have been proposed in the past [Z|. Of course, 
care still needs to be exercised, as a wrong definition will almost certainly yield 
the wrong properties. 

The way to proceed is thus very much in the spirit of Hilbert’s Grundlagen, 
namely to show that there is a number system (say a field such as the hyperreals) 
associated with the geometry and reducing consistency of Isabelle’s geometric 
theory to that of hyperreal arithmetic. This is achieved when working within the 
context of Isabelle/HOL, by developing a geometry theory according to the HOL- 
methodology i.e. strictly through definitions that capture the notions (points, 
lines, signed areas, etc.) that are being dealt with and then prove that the various 
properties follow. 

To carry out this task, the hyperreal theories of Isabelle are extended with 
the notions of hyperreal vectors. In essence, this is an algebraic approach which 
develops geometric objects and relations between these objects in the Cartesian 
product IR*" of the field of hyperreals, where n = 3. We develop a theory of 
vectors in three dimensions, although we are mainly interested in plane properties 
since this has an algebra rich enough to capture the various notions we want to 
deal with. Thus, it also has more scope for future use. The hyperreals are chosen 
rather than the reals since we can then express infinitesimal geometric notions 
as well. The definitions that are used in the theories are given next. One theory 
introduces the algebraic operations on vectors while the other deals with the 
development of simple analytic geometry. 

3.2 Hyperreal Vector Space 

In general, the simplest definition for a real vector in n dimensions is as an n- 
tuple of real numbers, (ri, . . . , r^). However, a more geometric definition can be 
provided that suits our purpose well. 

Definition 3. Given two points P = (xi,yi,zi) and Q = (x 2 ,y 2 ,Z 2 ) in M*^, 
the vector Q — P is called the directed line segment from P to Q. The components 
of the directed line segment are the terms in the 3-tuple {x 2 ~ Xi,y 2 — yi, Z 2 ~ zi). 

In this definition, we implicitly assume that the origin is given by the hyperreal 
coordinates (0,0,0) and hence that a particular point is specified by the vector 



Nonstandard Geometric Proofs 



249 



whose components correspond to its Cartesian coordinates. In Isabelle, we for- 
mulate a theory of three-dimensional vectors by first introducing vectors as a 
new type corresponding to a triple of hyperreal numbers fl 

hypvec = {p :: (hypreal * (hypreal * hypreal)). True} 

Once a new type has been introduced successfully, Isabelle provides coercion 
functions — the abstraction and representation functions — that enable us to 
define basic operations on the new type. Thus, in this particular example, the 
functions 



Abs_hypvec :: (hypreal * hypreal * hypreal) hypvec 
Rep_hypvec :: hypvec => (hypreal * hypreal * hypreal) 

are added to the theory such that hypvec and {p. True} are isomorphic by 
Rep_hypvec and its inverse AbsTiypvec. On a more intuitive level, one may 
simply read Abs_hypvec as: 



AbsTiypvec (x,y,z) = 




in what follows. 

We can then define the various operations on the new type. For example, the 
inner product or dot product of two vectors P and Q is defined, using tuples as 
patterns in abstractions cni, byS 

P-Q = {X{{xi,yi,zi),{x2,y2,Z2)). 

X1X2 + yij/2 + ^1-22) 

(RepTiypvec P, RepJiypvec Q) 

This definition is slightly more complicated than the usual textbook one since 
it uses an explicit A-abstraction and the representation function. However, we 
prove theorems that capture the more familiar definitions and which can then 
be fed to Isabelle’s simplifier for rewriting. So for the dot product, we have the 
expected: 



Abs_hypvec {xi,yi,zi) • Abs_hypvec (x2,y2,Z2) = X1X2 + yiy2 + Z1Z2 



Similarly, we also define other important operations, such as cross product and 
scalar multiplication (-g). For clarity, we give their definitions as the simplifica- 
tion theorems proved in Isabelle rather than the actual definitions in terms of 
RepJiypvec and A-abstractions. The Isabelle definitions unfortunately tend to 
be slightly cluttered and become somewhat hard to read, especially in the case 

^ The Isabelle notation a::r denotes that a is of type r. 

^ In what follows, the multiplication sign (•) between hyperreal variables is omitted 
whenever no ambiguity is likely to result. 
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of the cross product. So, for cross and scalar products we prove the following 
rules: 

AbsJiypvec (xi, j/i, 2:1) x AbsJiypvec (X2, 2 / 2 , -^2) = 

Abs_hypvec {yiZ2 - Ziy2,ZiX2 - XiZ2,Xiy2 - 2/1X2) 



a -s Abs_hypvec (x, y, z) = Abs_hypvec (ax, ay, az) 

For any two vectors P and Q, the cross product can be viewed as defining 
the vector area of a parallelogram, with the vectors as two of the sides of the 
parallelogram and P x Q perpendicular to the plane containing P and Q. With 
this nice geometric interpretation in mind, the next step involves proving various 
properties of the cross product. These will be needed for our investigation and 
will also enable to capture useful notions such as the signed area. The following 
theorem, which shows that the cross product is not commutative, is thus proved: 

PxQ= i-Q) X P 

Geometrically, this means a change in the direction of the vector while its mag- 
nitude remains unaffected. The negation of a vector P, for its part, is defined by 
negating its various components. In Isabelle: 

—P= (A(xi, X2, X3). AbsJiypvec (— xi, — X2, — X3))(Rep_hypvec P) 

In the next section, the definition of signed area of a triangle follows directly 
from the geometric interpretation and algebraic behaviour associated with the 
cross product. 

Various other algebraic properties of the operations introduced so far are 
proved in Isabelle. A few straightforward ones that are useful to the development 
are as follows: 

• P-Q = Q-P 

• P-{Q + R) = P-Q + P-R 

• {a -s P) ■ {b -s Q) = ab {P ■ Q) 

• Px{Q + R)=PxQ + PxR 

• - (P X g) = (-P) X g 

• ~{PxQ) = Px (-g) 

• ~{PxQ) = QxP 

• (a -s P) X {b -s Q) = ab -s {P x Q) 

9 P X P = 0 

• P-{Px R) = 0 

• P X {Q X R) = {P ■ R) -s Q - {P ■ Q) -s R 

• {a -s P + b -s Q) X R = a -s {P X R) + b -s {Q X R) 

In these theorems, the zero vector is defined, as expected, by 



0 = Abs_hypvec ( 0 , 0 , 0 ) 
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Another important concept that has not yet been introduced is that of the 
length or norm of a vector. For a vector P, this is usually denoted by |P| and 
defined by taking the square root of the dot product P • P. In Isabelle: 

hvlen P = hsqrt (P • P) 

The square root operation over the hyperreals, denoted by hsqrt in Isabelle, is 
defined as the nonstandard extension of the square root operation (sqrt) over 
the reals. Details of these nonstandard concepts are given elsewhere m and are 
not especially important to the current exposition. It is sufficient for our purpose 
to regard taking the square root of a hyperreal as a well-defined operation with 
the usual properties. Other important theorems proved in the theory include^ 

— Cauchy-Schwarz inequality: abs {u ■ v) < hvlen u ■ hvlen v 

— Minkowski inequality: hvlen (u -I- u) < hvlen u + hvlen v 

After proving some further results of vector algebra, we develop a simple 
geometry theory based on the geometric interpretation of vectors and their op- 
erations. In the next sections, the definitions and results of the vector geometry 
development, as it currently stands, are outlined. 

3.3 Hyperreal Vector Geometry 

Chou, Gao, and Zhang have also used vector calculations in automated geometry 
theorem proving Pj . They assert a set of basic rules about the operations that 
can be carried out on vectors. Theorems are then derived using these basic 
axioms of the theory. The algorithm used by Chou et al. is nice and relatively 
simple: given a construction sequence for a geometric configuration, the points 
(i.e. vector variables) are eliminated one at a time from the vector expression 
standing for the conclusion, until only independent vector variables are left. The 
conclusion that results is then tested to see if it is identically zero. 

In contrast to the above approach, we proceed by means of definitions only 
and having introduced hyperreal vectors and defined the operations on them, 
there is enough algebraic power for the theories to express geometric concepts: 
orthogonality and parallelism, signed areas, congruence of angles, infinitesimal 
geometric notions and much more. Moreover, we proceed mostly through sim- 
plification and substitution steps that are be applied to both the conclusion and 
premises of the current goal. That is, the proof steps in Isabelle are not limited 
to point elimination only. 

We first introduce as basic geometric objects the notions of points and lines 
by defining the following types in Isabelle: 

pt = {p :: hypvec. True} 
line = {1 :: (pt * pt). True} 

From these definitions, a point is therefore specified by a position vector and a 
(directed) line given by a pair of vectors representing its end-points. We note that 

® In Isabelle, abs x denotes \x\. 
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it is possible for a line to have its two end-points the same — this is not a problem 
as we can still prove all the expected properties. However, with hindsight, we 
should probably have ruled out this type of degenerate lines as this would have 
removed side conditions from several of our theorems. 

Notwithstanding the last remark, these definitions do give the theory a sepa- 
rate, nicer geometric interpretation in which geometric objects (points and lines) 
are dealt with rather than vectors of hyperreal numbers. The abstraction and 
representation functions of Isabelle enable us to deal with the underlying vector 
theory to prove basic properties of parallelism, perpendicularity, collinearity etc. 
Once this is done, we can hope to work at a higher abstract level which deals 
with geometric relations and interact rather minimally with the underlying vec- 
tor constructions. This is similar in spirit with our construction of numbers, say 
the reals by Dedekind cuts, where initially for each operation we have to prove 
cut properties but as more theorems are proved, we deal less and less with the 
actual cuts and more with the algebra of the reals. 

However, in the subsequent exposition we shall regard position vectors and 
points as being interchangeable when giving the definitions and describing prop- 
erties proved. This abuse of notation is simply to make the definitions more 
readable on paper since it avoids the use of the coercion functions. We will show 
the definitions or theorems as actually formulated if the need ever arises. We 
also note that the notation A — B, used in Isabelle for a line from point A to 
point B, is syntactic sugar for Abs_line(H, 5). Therefore, for each geometric 
condition, we have the corresponding vector definition: 

1. That A, B, and C are collinear: 

coll CAB = {C- A)x {B-A) = 0 

2. That AB is parallel to CD: 

A — B\\C — D={B-A)x{D-C) = 0 

3. That AB is perpendicular to CD: 

A — B AC — D={B-A)-{D-C) = 0 

4. The length of a line AB: 

len (A — B) = hvlen (B — A) 

5. The signed vector area of triangle ABC: 

s.delta A B C = 1/2 -s {A - B) X {C - B) 

6. The angle between AB and CD: 

{A B, B C) = arcos (unitvec (A — B) ■ unitvec (C — B)) 



where 



unitvec P = (1/hvlen P) -g P 
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The definition of the angle relies on the theory of transcendental functions 
developed in Isabelle. In our work on the formalization of analysis, the var- 
ious trigonometric functions are defined over the reals through their power 
series expansions, and then extended to the hyperreals HU). 

With these definitions set up, we prove that the basic properties of signed areas 
actually hold and justify the statements of geometric relations that were made in 
terms of them. The rules about the sign of the area depending on the ordering 
of the vertices of the triangle are all proved without any problems since our 
definition makes them direct consequences of the algebraic properties of the 
cross product. Consider, for example: 

— s_delta c b a= —1/2 -g {c — b) x {a — b) 

= -1/2 -s (-(a - b)) X (c-b) 

= 1/2 -g (a — b) X {c — b) 

= s_delta a b c 

This and similar rules are proved with the help of Isabelle’s automatic tactic and 
added to the simplifier. The definition of parallelism in terms of signed areas is 
also easily verified: 

a — b \\ c — d (s_delta a b c = s_delta a b d) 

and the following theorem defines incidence (or collinearity) in terms of signed 
area: 

coll abc (s_delta a b c = 0) (1) 

We also extend the definition of incidence to that of a set of points incident 
on a line, thereby enabling us to prove some more theorems. We can deal with 
the ratios of oriented lines by proving theorems such as these: 

-A — B\\C — D: 



C^D 



len {A — B) {B - A) ■ {D - C) 



len (C — D) {D-C)-{D- C) 

— if i? is the foot of the perpendicular from point A to line PQ\ 

p p len (P — R) {A-P)-{Q- P) 
len {P Q) len {P — QY 

— if two non-parallel lines intersect at a point R\ 

len (P — R) -g {Q-P)x{V-U) = 
len {P-Q)-g{U-P) x{V-U) 

Some of the results above are high level lemmas stated by Chou et al. as being 
used in their automated GTP method based on vectors P). We verify all of 
them in Isabelle and store them as lemmas that become valuable when proving 
complicated geometry theorems. This verification of lemmas used in the area 
method is not a mere exercise as it consolidates the axiomatic geometry that we 
previously used in Isabelle. Moreover, since we are able to prove the expected 
geometric properties in the formalization, this gives us a relatively high degree 
of assurance that we are using the right definitions for various concepts. 
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3.4 Introducing the Infinitesimal Geometry 

We start by extending some of the definitions used for the hyperreals (see Section 
0 to their vectors. 

Definition 4. A hyperreal vector P is said to be infinitesimal, finite, or infinite 
if its length |P| is infinitesimal, finite or infinite respectively. Moreover, P is 
infinitely close to Q (P Q) if and only if Q — P is infinitesimal. 

With this definition formalized in Isabelle, the following equivalence theorem 
about infinitely close vectors is proved: 

Abs_hypvec (xi,yi,Zi) fHy AbsJiypvec (0:2,2/2,22) 

4 =^ Xi Ri 0:2 A 2/1 2/2 A 01 Ri 02 

In other words, two hyperreal vectors are infinitely close if and only if their 
components in corresponding positions are infinitely close to one another. This is 
a useful theorem that can be used in many cases to reduce infinitesimal reasoning 
involving vectors to similar reasoning over the hyperreals. We also prove the 
following theorems about the different types of vectors: 

1. P is infinitesimal if and only if all its components are infinitesimal. 

2. P is finite if and only if all its components are finite. 

3. P is infinite if and only if at least one of its components is infinite. 

and many other interesting theorems about the algebra of the operations and 
relations on them, such as|j 



[|a e M-,a^ 0|] => (a -sW a -g v) = {w v) 

U 0 => M • M ~ 0 
[|u 0; w S VFinitej] => u x w 0 

[|u Ri„ tc; w G VFinite|] => u x w tHy 0 
{x R!„ y) (hvlen {y - x) k. 0) 

X ~v y hvlen x ~ hvlen y 

[|a b;c € VFinitej] => o • c Ri 6 • c 

X ■ X G Infinitesimal 4=4> x G Vlnf initesimal 

[jx G Vlnf initesimal; y G VFinitej] => x ■ y G Infinitesimal 

u G VFinite — Vlnf initesimal => {{u x v r:„ 0) (3k. v 



( 2 ) 

( 3 ) 

( 4 ) 

( 5 ) 

( 6 ) 

( 7 ) 

(8) 
(9) 

( 10 ) 

k-yU)) ( 11 ) 



where Vlnf initesimal and VFinite denote the sets of infinitesimal and finite 
vectors respectively. Most of the theorems proved, we believe, have clear geomet- 
ric readings and formalize the intuitive behaviour one would expect. Theorem 
( 0 , for example, can be used directly to prove an intuitive theorem about a 
shrinking triangle in which one of the sides is infinitesimal. In Fig. 0 for ex- 
ample, one can intuitively see that as the length of be becomes smaller, the 

The Isabelle notation . . . ,4>n\\ => V’ can be read as if A ... /\ fn then tp. 
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a 




Fig. 1. A “shrinking” triangle 



lengths of ab and ac approach each other, until they are infinitely close when be 
is infinitesimal. This is captured by the following Isabelle theorem: 

len (6 c) ~ 0 len (a b) Ri len (a — c) 

Interestingly, if the lengths of the sides ab and be are real valued, then they have 
to be equal (i.e. triangle abc is an isosceles) when be is infinitesimal: 

[|len (a — b) S M' len (b — c) € IR; len (a — c) ~ 0|] 
len (a b) = len (6 — c) 

This is because of a theorem stating that two real numbers that are infinitely 
close to one another are effectively equal. We also formally derive, for example, 
theorems such as: 

[|len (a — b) G Finite; len (b — c) G Infinitesimal!] s_delta a b e 0 
and 

[jcoll abe; s_deltap b e Oj] => s_deltap a e s_deltap a b (12) 

The latter (see Fig. E) is proved using the cancellation theorem m, as well as 
various others involving associativity and commutativity of vector addition to 
perform AC-rewriting. These are just a few of the infinitesimal geometry theo- 
rems involving familiar geometric concepts. We next introduce a number of basic 
concepts systematically defined using the various notions from our nonstandard 
vector theory. 

4 Some Infinitesimal Geometric Notions 

Each of the new definitions can be viewed as weakening of the more familiar 
ones. We start with a nonstandard formulation of parallelism and orthogonality. 
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Fig. 2. Infinitely close areas 



Almost parallel and almost perpendicular 

Just as the concept of two lines being parallel was introduced, using hyperreal 
vectors the weaker notion of two lines being almost parallel is defined (with 
B and C ^ D): 

A — B ||a C — D = unitvec {B — A) KSy unitvec (D — (7) V 
unitvec (B — A) Riy —unitvec (D — C) 

We trivially prove that this is an equivalence relation. More importantly, the 
relation between this definition and that of parallel lines (see Section 15. 511 is 
highlighted by the following theorem, which is also proved in Isabelle 0 

[\D — C G VFinite — VInf initesimal; (13) 

B — A G VFinite — VInf initesimalj] 

^ A — B\\aC — D^{B-A)x{D-C)!HyO 

The theorem expresses the almost parallel property in a form similar to that 
of ordinary parallelism, with equality replaced by the infinitely close relation. 
However, there is a notable difference which is shown as an additional conditions 
on the two lines. Without the conditions, (m above is not a theorem as the cross 
product of an infinitesimal and infinite vector is not necessarily infinitely close to 
zero. Also, in terms of area, justifying a more geometrically intuitive definition 
based on signed areas, we have: 

[|len (C D) G Finite — Infinitesimal; 

len (A B) G Finite — Inf initesimalj] 

==^ A — B IIq C — D (s_delta a c d s_delta bed) 

We also define the notion of two lines being almost perpendicular. Once 
again, we make use of the notion of unit vector to get a suitable definition. Lines 

A — B _L(j C — D = unitvec (B — A) ■ unitvec (D — C) ~ 0 

® We wish to thank one of the referees for pointing out an omission in our initial 
statement of the theorem. 
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We note that since the dot product produces a hyperreal, we use the infinitely 
close relation Ri over these numbers rather than which is defined over hyper- 
real vectors. 

Almost collinear 



P 




Fig. 3. Infinitely close areas 



We next introduce the notion of three points being almost collinear. Intu- 
itively, one might expect three points a, b, and c to be almost collinear (denoted 
by acoll a 6c in Isabelle) if and only if the signed area s_delta a b c is infinitely 
close to zero. Such a definition would be very similar in spirit to the equivalence 
theorem (^. However, since our geometry allows both infinitesimal and infinite 
quantities, this definition is inadequate: it does not hold in the case where two of 
the points concerned, say 6 and c, are infinitely far apart and the third one, say a, 
is infinitely close to the line be. This is because the cross product {c—b) x (a — b) 
is not necessarily infinitely close to zero in this case as well. Instead, we define 
the property as follows: 



acoll abc = {b — a) ||q (6 — c) 

and prove a number of theorems involving it such as the variant of dnj, shown 
in Fig. 0 

[jacoll abc] s_deltap 6 c 0|] s_deltap a c ^y s_deltap a b 
Infinitesimal angles 

Our NS A theory is powerful enough to prove theorems involving the trigono- 
metric functions and infinitesimal angles. For example, we can formally formulate 
and prove assertions such as 

sin(0) = 0 and cos(0) = 1 where 9 is infinitely small 

that one often sees in textbooks. These are rarely given any further justification: 
the reader needs to rely on her knowledge of trigonometric functions and on 
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her intuition about what infinitely small means to see that the statements are 
indeed plausible. Such assertions can be formalized in NSA, however, by making 
9 an infinitesimal and replacing equality by the infinitely close relation Ri. The 
proofs are intuitive, yet rigorous, and relatively easy to mechanize. We give, as 
an example, a brief proof of the statement sin(6*) « 9. 

In the NSA theory jEj of Isabelle/HOL, the formal nonstandard definition 
of the derivative of a function f at x (DERIV) is given by: 

DERIV(a:) f :> d = \/h G Infinitesimal — {0}. — /(^ ^ ^ 

This is simply saying that the derivative of / at a; is d if ^ is infinitely close 
to d. With this, and assuming the standard results (proved in Isabelle) that 

cos(O) = l,sin(0) = 0, 



and 

DERIV(a;) (Ax. sin(x)) :> cos(x), 
we can easily prove that cos(0) ~ 9 for all infinitesimal 9. 

Proof: 

if d = 0: This is trivial since ~ is reflexive. 

else if 0 0: Since DERIV(x) (Ax. sin(x)) :> cos (x), for all x, we have that 

DERIV(O) (Ax. sin(x)) :> cos(O) 

=> V/i G Infinitesimal — {0}. (sin(0 + h) — sin(0))/h « 1 
=> (sin(0 + 9) — sin(O))/0 ~ 1 
=> sin(0) /9 ml 
=> sin(0) m 9 

One important point to note is that we made use of the following theorem to 
reach the final step: 

a m b c € Finite 
a ■ c m b ■ c 

where Finite is the set of finite numbers and Infinitesimal C Finite H3|. In a similar 
fashion, we also prove that cos(0) ~ 1 and, interestingly, that tan(7r/2 + 9) G 
Infinite, for all infinitesimal 9. We expect such results involving angles and 
trigonometry will to prove useful in the further development of the geometry. 

In addition, we also prove that the angle between two lines which are almost 
perpendicular is infinitely close to 7 t/ 2, i.e.. 



b, c — d) m 7t/2 
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Almost similar triangles 

This is basically the notion of ultimately similar triangles that we have de- 
scribed and used a number of times before HUE!. We briefly recall its definition 
here: 



USIM ah c a' b' c' = {h — a, a — c) ~ {h' — a' , a' — c') A 

(a — c, c — b) ~ {a' — c' , d — b') A 

(c — b, b — a) ~ {o' — b' , b' — a') 

We are still formally investigating the properties of this concept. We have al- 

ready reproduced in our new setting most of the theorems described in previous 
work m- Similarly, we have defined the notion of two triangles being almost 
congruent . 



5 Nonstandard Proofs of Standard Geometry Theorems 

Our nonstandard techniques are strong enough to produce nice proofs of tradi- 
tional geometry theorems. The proofs can be viewed as moving into the hyperreal 
space, just as it is possible to move into complex space when dealing with proofs 
in analytic geometry. In what follows, we illustrate our nonstandard methods by 
considering infinite polygonal approximations of the circle. 

5.1 Polygonal Area Approximation 

We first consider a nonstandard proof that the area of a circle of radius r is 7rr^. 
The area of the circle will be shown to be infinitely close, hence equal, to the 
area of infinitely many enclosed (inscribed) polygons. 

In Fig. E] the area of the closed polygon P = Ai . . . A„ is defined by the 
formula: 



area Ai . . . Aji = OA 1 A 2 -t- OA2A3 -t- . . . -t- OAji—iA^ -t- OA^Ai 

where OA 1 A 2 , for example, represents the area of triangle OA 1 A 2 (which we 
re-define below in terms of the vector outer-product in the plane) . The value of 
the polygonal area is independent of O but depends on OA,, the radius vector 
to the fth point. The definition of polygonal area looks recursive except for the 
last area term (OA„Ai). In Isabelle, this motivates the following definition for 
the area of the polygon with the zero vector as O (and the polygonal points 
numbered from 0 rather than 1): 

polyArea P n r = pArea P n r + area 0 {P nr) (P 0 r) 
where pArea, the area of the open polygon, is inductively defined as 
pArea P 0 r = 0 

pArea P n r = pArea P (n — 1) r + area 0 (P (n — 1) r) (P n r) 
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Fig. 4. A closed polygon 



and 



area ab c= l/2[(6 — a)(c — a)] 

[AbsJiypvec (cci, j/i)Abs_hypvec {x 2 ,y 2 )] = Xiy 2 - yiX 2 (14) 



We note that a parameter r, which at first sight might appear superfluous, is 
included in the definitions of both polyArea and pArea. This is because often 
the radius vectors OAi does not depend on just i but also on some other quan- 
tity such as an angle. The two parameters (e.g. multiplied) together enable us 
to progress along the curve being approximated. In an initial formalization, we 
omitted the extra parameter but then found that we could not adequately rep- 
resent the inscribed polygon. This lead to the revised definition presented in this 
section. 

Now, if C is a circle of radius 1, we can inscribe a polygon Ai ■ ■ ■ An by 
choosing points A\, A 2 ,..., An in order along it. If n is an infinite hypernatural 
number then the points Ai crowd one another, and we expect to arrive at the 
formula for the area enclosed by C. 

In our mechanized proof, we first consider the unit semi-circle ABC (see Fig.EJ). 
Using the angle 9 between successive radius vectors as parameter, the polygon 
can be defined by the following sequence: 



where k denotes the fc-th point of the polygon. Hence, given that n points are 
inscribed in the semi-circle, the angle between the radius vectors is 7r/n and so 
the polygonal area is: 



We then easily prove by induction and with the help of the mechanized lemma: 



Xk 9. Abs_hypvec (cos k9, sin k9) 



(15) 



polyArea (Xk9. AbsJiypvec (cos fc0, sin fc0)) n (tt/u) 



sin {x — y) = cos y sin x — sin y cos x 



(16) 
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supplied to Isabelle’s simplifier that the following theorem holds: 

polyArea (Afc 0. Abs Jiypvec (cos sin fed)) n (tt/ti) = l/2nsin(7r/n) (17) 
We use the fact that: 

area 0 (c -s x) (c -s y) = ■ area 0 x y 

to prove by induction the following property of polygonal areas: 

polyArea {Xnr. c-sPnr)NR = (? - polyArea P N R (18) 

which means that for a semi-circle of radius r, we have: 

polyArea (A/c 0. Abs Jiypvec {r cos k9 , r sin k9)) n (Tr/n) = l/2r^nsin(7r/n) 

(19) 

Given that n, the number of inscribed points, is an infinite hypernatural number, 
we have that 7r/n is infinitesimal. But, from the result in the previous section 
about infinitesimal angles, we also know that 

sin(7r/n) 

(7r/n) 

and hence that 

nsin(7r/n) ~ tt (20) 

This result, with above, allows us to prove that: 

polyArea {Xk9. Abs_hypvec {r cos k9 , r sin k9)) n (tt/u) ~ l/27rr^ 

from which we deduce that for a circle, with the angle between successive radius 
vectors given by 27r/n, the following holds: 

polyArea (Afc 0. Abs Jiypvec {r cos k9 , r sin k9)) n (^T^jn) ~ 7rr^ (21) 
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Hence, by “exhausting” the circle with a inscribed polygon of infinite number of 
sides, we have formalized a simple and relatively intuitive proof that the area of 
the circle of radius r is infinitely close to 7rr^. If we assume that the area of the 
circle is a real quantity then we can deduce that it is equal to 7rr^, as one would 
expect. 



5.2 Polygonal Length Approximation 




Fig. 6. Approximating the length of a curve using an inscribed polygon 



Our technique for determining areas is easily adapted to determining the 
length of a curvilinear arc. Geometrically, the length of a polygon in- 
scribed along some arc is given by Ym=i (see Fig. El- In Isabelle, this 

is a direct recursive definition (with the points numbered from 0 rather than 1): 

pLength P 0 r = 0 

pLength P n r = pLength P (n — 1) r + hvlen {{Pnr) — {P {n — 1) r)) 

This definition is further refined to the case when we are dealing with a closed 
curve such as a circle. We then need to close the polygon by adding the length 
of the vector (segment) from the last to the first point of the figure. In Isabelle, 

polyLength P n r = ifn=l then hvlen ((P 1 r) — (PO r)) 

else pLength P n r + hvlen {{Pnr) — (POr)) 

The definition is conditional to prevent the length of a degenerate closed polygon 
with only two points Aq and A\ (i.e. a line) from being defined as |AoAi| -|- 
|AiAo|. With these concepts defined, we prove a theorem about polygonal length 
analogous to theorem m about area: 

polyLength (Anr. c -s P n r) N R = abs c • polyLength PNR 

This property holds because of the following theorem about lengths of vectors: 

hvlen {c -s X — c -g y) = abs c • hvlen {x — y) 
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We continue with our case study involving the circle and outline how to 
approximate its circumference using an infinite polygonal approximation. The 
mechanization follows a similar approach to that of the previous section and uses 
the parametric definition for the inscribed polygon. This time we consider n 
points being inscribed in a circle of radius r, which means that the angle between 
each radius vector is 27r/n. The polygonal length is now given by the theorem: 

polyLength [XkO. AbsJiypvec (rcosfc0,rsinfc0)) n {2'nj'n) (22) 

= abs r ■ polyLength (XkO. Abs_hypvec (cos sin fc0)) n 
= abs r • nsj2 - 2 cos {2ti I n) 

proved by induction on n followed by simplification. Using the following lemmas 
(all proved in Isabelle) as rewrite rules: 

cos 2x = cos^ X — sin^ x 
cos^ a: = 1 — sin^ x 
2 — 2 cos 2a; = 4 sin^ x 
Vx^ = abs X 



theorem J22I) automatically simplifies to 

polyLength {XkO. Abs_hypvec {r cos kO , r sin kO)) n (27r/n) (23) 

= abs r ■ n ■ abs (2 sin(7r/n)) 

With this result set up, we are almost done, since using theorem , we now 
have that: 

n ■ abs {2sin{'K / n)) = abs (2nsin(7r/n)) ~ abs (2n(7r/n)) = 2tt 
and hence that: 

polyLength {XkO. Abs_hypvec {r cos kO , r sin kO)) n (27r/n) ~ 27 t • abs r (24) 

Once again, our infinite approximation has provided a reasonably intuitive proof 
of a familiar geometric result. All our proofs proceed with a relatively high degree 
of automation since much of the simplification work can be done automatically 
by Isabelle’s rewriter. 

Our definitions for polygonal area and polygonal length are generic and can 
thus be used to approximate the areas and lengths of other figures provided these 
can be defined formally. This also means that the techniques described in this 
section are general ones that, we believe, provide new methods for mechanical 
theorem proving in geometry. 

5.3 Brief Remarks on the Mechanization 

We now make a few further remarks on the proof formalization just considered. In 
particular, we give some indication of the amount of work involved in completing 
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some of these proofs, and discuss how the proof process might be (completely) 
automated. 

As we have already mentioned, Isabelle can provide a relatively high degree 
of proof automation in many cases (e.g. through the use of built-in decision pro- 
cedures and automatic tactics). However, since the system is a proof assistant 
rather than an automatic theorem prover (like Otter m, say) the user is ex- 
pected to play an active role in the proof-finding process. Indeed, during proofs 
the user often interacts with the system and guides it by indicating which tactic 
and rules to apply at each step. 

The proof of lemma o, for example, requires 5 steps and takes less than 
half a second. Four of these steps are rewrites carried out by supplying rules such 
as (EJ and Ij1 6I> to Isabelle’s automatic tactic auto_tac. The only different step 
is a case-split: this requires an explicit intervention in which we consider, using 
Isabelle’s case_tac tactic, the cases n = 0 and n yf 0 as two separate subgoals. 
As mentioned in Section 16. 2L theorem JZO) then easily follows from this lemma 
and result (f21)ll . This last proof is automatic: it only requires one application of 
auto_tac with these theorems supplied as rewrite rules and takes negligible time 
to complete. 

Another important aspect of many of our proofs, which directly affects their 
possible automation, is the need for mathematical induction. As our definitions 
for the area and boundary of a polygon are both recursive, the use of induction 
in proving their properties is to be expected. Thus, in the case of theorem (I I !SII . 
for instance, we have to explicitly specify on which variable Isabelle is to perform 
the induction {N in that case). More importantly, the use of an inductive theory 
for expressing our geometric notions means that, in general, we cannot hope for 
full automation (due to results like Godel’s first incompleteness theorem). 

However, these negative aspects do not mean that we have to give up on 
trying to automate these proofs altogether. In fact, there are has been extensive 
work on the automation of inductive proofs in the past by Boyer and Moore |2|, 
Bundy |S|, Kapur and many others. The proof-planning approach of Bundy 
is especially relevant to us, as we are currently involved in integrating it with 
Isabelle. This successful Al-style planning technique which guides inductive (and 
other types of) proofs through the use of powerful heuristics should, we hope, 
provide a clear path to the automation of our own geometric proofs. 



6 Further Work 

As mentioned already, this paper describes work currently in progress. We still 
have much of the geometry to explore. One currently unproved conjecture, for 
example, is that two (co-planar) lines which are almost parallel do meet at a 
point infinitely far away i.e., we expect to have a well-defined, non-degenerate 
solution to the problem. 

We now have a relatively well developed vector theory. This contains many 
of the familiar theorems about vector operations as well as the new theorems 
involving the infinitely close relation, infinitesimal and infinite vectors, and other 



Nonstandard Geometric Proofs 



265 



nonstandard notions. As the work proceeds, we expect to add more theorems to 
provide a theory that can be useful for other purposes (e.g. proofs in mechanics 
that often involve vectors and infinitesimals). 

We will be introducing and investigating other, perhaps less obvious, almost 
relations. For example, we have recently mechanized notions of approximate 
geometric objects. Using this notion, an ellipse with infinitely close foci can be 
regarded as being almost (but not quite) a circle. Other notions include “almost 
betweenness”, approximate point inclusion in a triangle, and “almost a tangent” 
to a circle, for example. 

Another aim will be to mechanize more geometric proofs that use infinitesi- 
mal and infinite quantities to reach infinitely accurate approximation results. We 
have formalized the useful notion of a polygon with an infinite number of sides 
which can be used to approximate any closed figure (curve). We have shown, 
in details, how this can be used to derive simple and intuitive proofs about the 
area and circumference of the circle. We will mechanize other proofs that use 
Archimedes so-called “Method of Exhaustion” in which one figure is approxi- 
mated more and more accurately by another one in order to compute geometric 
quantities such as areas and volumes. We believe that these are proofs not cur- 
rently captured by existing mechanical theorem proving methods. The work of 
Baron [Q, for example, provides a wealth of such proofs throughout the centuries 
for us to work with and mechanize. 

Finally, as mentioned in the previous section, we hope to make automation an 
important feature of our approach by using proof-planning, rather that human 
intervention, to guide Isabelle in finding geometric proofs of conjectures. This 
goal should create a nice link between our geometry work and some of our other 
interests. 

7 Concluding Remarks 

In this paper, we have formally introduced the notion of an infinitesimal geome- 
try based on hyperreal vectors. Various theorems have been proved that have no 
direct counterparts in Euclidean geometry since the latter only deals with real 
numbers. 

Vector algebra offers an attractive approach to mechanical geometry theorem 
proving. There is much active research going on using the related field of Clifford 
algebra, which is generally regarded as being more expressive |2 lft)j . In our case, 
since we are doing interactive rather than automatic theorem proving, vectors 
provide a simple and adequate approach to analytic geometry. Also, as was shown 
by Dieudonne, inner (dot) and cross products of vectors are sufficient to develop 
elementary geometry |H|. 

As far as we are aware, this is the first mechanization of a theory of hyper- 
real vectors. Moreover, Keisler’s textbook is, to our knowledge, the only work to 
give a brief exposition of a vector theory H3- As a result, most of the theorems 
mechanized in Isabelle have been proved independently of any previous work 
or textbooks. We have shown that these vectors obey the usual algebraic rules 
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for vectors since they form an inner product space over the field IR* . By using 
the extended vectors instead of real vectors, it is possible to describe, in addi- 
tion to ordinary geometric concepts, the novel notions of infinitesimal geometry 
presented in this paper. 

The analytic geometry development was carried out to provide a rigorous 
definitional approach in which to investigate our infinitesimal geometry. By fol- 
lowing the HOL methodology, we have the guarantee that our formalization is 
consistent and that all results proved are actual theorems about the geometry we 
have developed. In addition, it provides support for our previous work by giving 
rigorous proofs of many of the basic rules from the GTP methods of Chou et al. 
that were previously asserted as axioms in Isabelle. 

As a final note, we remark on an important realisation emphasized by the 
current work: the inclusion of infinitesimals and other nonstandard concepts in 
geometry introduces subtle issues that can easily lead to inadequate definitions. 
Indeed, it can be problematic to formulate concepts that rely on some form 
of product (cross, dot, multiplication etc.) as the operation can be ill-defined 
whenever it involves both an infinitesimal and an infinite quantity. We became 
especially aware of the subtlety involved when our initial definition for almost 
parallel lines (we used the equivalence theorem (nn)i without the associated con- 
ditions) proved inadequate. We could not prove some of the properties we felt 
should hold since we were implicitly ruling out an infinitesimal line and an infi- 
nite line being almost parallel. 

The realisation came after some experimentation with the framework and did 
force us to exercise much more care. However, the fact that we encountered such 
a problem is probably unsurprising. After all, the flaw that we found in one of the 
famous proofs of the great Newton was also of this nature US; it involved taking 
the ill-defined product of an infinitesimal and an infinite quantity. However, this 
is a useful experience that will help us as we explore more challenging concepts 
in this geometry. 
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Abstract. The underlying principles and main original techniques used 
in a running generic logic-based theorem prover are presented. The sys- 
tem (a prototype) is called HOARD atinp (Human Oriented Automated 
Reasoning on your Desk) and has been specialized in this work to proof 
learning through geometry. It is based on a new calculus, particularly 
suited to the class of problems we deal with. The calculus allows treat- 
ment of equality and automatic model building. HOARD atinp has some 
other original characteristics such as proving by analogy (using matching 
techniques), some possibilities of discovering lemmata (using diagrams), 
handling standard theories in geometry such as commutativity and sym- 
metry (by encoding them in the unification algorithm used by the calcu- 
lus) , and proof verification in a rather large sense (by using capabilities 
of the calculus). 

As this work is intended to set theoretical bases of a new logic-based 
approach to geometry theorem proving, a comparison of features of 
our system with respect to those of other important, representative 
logic-based systems is given. Some running examples give a good taste 
of the HOARD atinp capabilities. One of these examples allows us to 
compare qualitatively our approach with that of a powerful prover 
described in a recent paper |S|. Some directions of future research are 
mentioned. 

Keywords. Automated geometric reasoning, analogy, model (counter- 
example) building, proof structuring with diagrams, computer assisted 
learning. 



1 Introduction 

For many years we have worked with the aim of improving general theorem 
provers from a qualitative point of view in the framework of a project called 
ATINF (French acronym for “ATelier dTNFerence”, meaning “Inference Labora- 
tory”; see pm for example) . We oppose qualitative to (only) “fast and cleverly” . 
More precisely, we would like general theorem provers not only to prove theo- 
rems, but also to help users to analyze and present proofs, to find analogies with 
other proofs, to build counter-examples (models), etc. 



J. Richter-Gebert and D. Wang (Eds.): ADG 2000, LNAI 2061, pp. 268-|^2^ 2001. 
(c) Springer- Verlag Berlin Heidelberg 2001 
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Of course our aims are not original and since the very beginning of automated 
theorem proving some pioneer work pursued similar goals (see e.g. m and 
the work of Bledsoe’s group already in the seventies). This early nice work is 
characterized by ad hoc approaches to particular classes of problems. We contend 
that a general approach towards a formalization of powerful human techniques 
of reasoning is possible (and valuable). This generality is not contradictory with 
a specialization a posteriori to particular classes of problems. 

The present work is an application of our views to a particular domain: learn- 
ing of proof through geometry. To reach this goal we specialize some techniques 
presented elsewhere (see for example mmm) and well-known such as E- 
unification (i.e. unification modulo equational theories) or hyper-resolution and 
paramodulation. The latter are the two components of a new calculus that is at 
the very basis of the system. We also use diagrams in a more human-oriented 
way, i.e. not only to prune the search space by filtering but also in order to 
suggest lemmata allowing to plan and to structure proofs (the use of diagrams 
not only as a filter but also as a guide is suggested too in nm and in m but in 
a much more informal and embryonic way) . 

Albeit some of our proposals are only partially developed, these are not only 
theoretical propositions or general ideas about the subject: a running prototype 
based on this approach is presented in detail and some running examples are 
shown. 

As it is well known, there exist some extremely efficient, powerful systems for 
geometrical theorem proving (see e.g. IS]). Such systems are based on powerful 
algebraic methods such as Wu’s. The basic principle is to translate the consid- 
ered problem into an algebraic language, and then solve the considered set of 
equations, using specialized algorithms (for example the “characteristic set” or 
Grobner basis method, see e.g. (3). The powerful methods of Tarski, Collins 
and Wu have on one side limited range of applications (see e.g. m) and on the 
other side — and much more important for our present purpose — they do not 
provide readable proofs. 

There exist a few systems based on purely logical approaches to geometri- 
cal theorem proving (see e.g. [1 oy2S] l and the most powerful (general-purpose) 
existing theorem provers such as OTTER, SPASS, and SETHEO are not suc- 
cessful when trying to prove geometry theorems. It is commonly admitted that 
algebraic approaches are the most powerful and efficient but there are several 
strong reasons to justify further investigation of logical approaches (see e.g. |B|). 

Our main goal is not to program a new logic-based geometry theorem prover, 
but to show that it is possible to incorporate to the provers formalizations of very 
useful human techniques that increase their power and make their interaction 
with users much more natural. 

Learning (teaching) of the notion of proof through geometry seems to us both 
a challenge and a way to show evidence of the usefulness of our approach. 

This work presents part of the automated reasoning module of a project 
on computer assisted learning named Baghera, under development at the 
LEIBNIZ-IMAG laboratory. One of the main aims of Baghera is to fill a gap 
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in existing tutorial systems (see e.g. P): these systems neither perform really 
automated reasoning (i.e. proof verification, proposition of alternative proofs, 

. . . ) nor offer techniques for the analysis or presentation of proofs. Consequently, 
we have tried to incorporate these features in our theorem prover HOARDatinp 
(Human Oriented Automated Reasoning on your Desk). For reasons both his- 
torical and of closeness we started from a tutorial systems called CABRI-Euclide 
(see I20I21I . and references therein) and in a similar way we did with our sys- 
tem GLEFatinp (see Pj), and we fixed a list of requirements to reach (“ideal” 
specification) . 

The system should be able to verify proofs (and to complete them if necessary: 
human beings never write fully formalized proofs); to build counter-examples 
(models)] to verify and to suggest analogies between proofs and/or formulas; to 
do abduction — i.e. to identify (supplementary) hypothesis that could allow to 
prove (or to “explain”) a given assertion; to perform some form of proof planning, 
suggesting a guideline for the stuck student. 

Obviously the system should be efficient enough to react to requests in “rea- 
sonably” short time. It should also be generic because it is intended to be used 
by students of very different levels of knowledge. Therefore the underlying geo- 
metric theory can change. The system must be able to adapt to these changes 
(i.e. to deal with different theories), in order to construct proofs understandable 
by the user 0 Therefore, no general axiomatization (such as Hilbert’s or Tarski’s) 
can be used. This need of genericity prevents us from using very specialized and 
specific approaches such as expert systems, and makes the choice of a logical 
approach very natural. 



Organization of the Paper 

The rest of the paper is organized as follows: In Section El we describe the 
fragment of first-order logic used to encode geometric problems submitted to 
HOARDatinp, and we present the proof calculus that is used to obtain proofs. 
This calculus is a refinement of the ordered hyper-resolution -|- ordered posi- 
tive paramodulation rules and is particularly well adapted to human oriented 
proof presentation (see Annex A). We particularly emphasize its specificity w.r.t. 
existing similar calculi. 

In Section 0 we describe the additional features that have been added into 
the system, such as proof analysis, proof generalization, analogy detection, and 
counter-example (model) construction. 

Section 0 particularly emphasizes one important semantic aspect of the sys- 
tem: the ability to use diagrams for guiding and structuring proofs, in particular 
to generate lemmata. 

^ Of course, another solution would have been to compute the proof in some fixed 
axiomatization of geometry, and then to translate it into the desired axiomatization. 
But designing such a translation algorithm would be a difficult — and possibly 
infeasible — task. 
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Sections 3 and 4 are those that will be studied more extensively in our future 
research. Nevertheless we consider the already obtained results as very promising, 
as hopefully shown by the examples. 

Section Ogives a short conclusion and main lines of future work. 



2 The Theorem Prover HOARDatinf^ 

Principles and Main Featnres 

In this section, we describe the theoretical bases of HOARD atinp- The prover 
is implemented in Prolog (for the sake of fast prototyping and portability). 
HOARD atinp is still a prototype and a lot of implementational work remains 
to be done before putting it in the public domain (see Section 5). 

Remark 1. It is neither possible nor useful to recall here all the standard notions 
of automated reasoning used in this work. The interested reader can consult 

Eau- 

2.1 A Restricted First-Order Language 

HOARD atinp uses a subclass of first-order logic. We recall below some basic 
definitions concerning the language. 

Definition 1. Let E be a set of functional symbols, let X be a set of variables. 
Let arity be a function mapping each element of E into an element o/IN. The 
set of terms T{E, X) is the least set that satisfies the following properties: 

- X C T(E,X); 

- Lf f € E, arity(f) =0 (i.e. the constants) then f € T(E,X); 

- Lf f e E,arity{f) = n,n > 0, (ti, . . . , G T{E,X)^ then /(ti,...,t„) G 
T{E,X). 

The set of terms containing no variables are called ground and noted T{E) 

Definition 2. An atom is of the form t = s, where t and s are two terms. 
An atom is said to be non-equational iff it is of the form t = true or true = 
s, equational otherwise (i.e. if it is of the form t = s where t and s are not 
syntactically identical to true). 



Remark 2. In this paper, we only consider atoms of the form t = s. No predicate 
symbols other can “=” are allowed. It is well known that this restriction does not 
entail any loss of generality, because any atom of the form P(ti, . . . ,tn) where 
P is a predicate symbol may be replaced by an equational atom of the form 
p(fi, . . . , tn) = true, where true is a special term and p is a function symbol of 
arity n. 
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Definition 3. A literal is either an atom (positive literal) or the negation of 
an atom (negative literal^. A literal is said to be equational iff the corresponding 
atom is equational, non- equational otherwise. 

A clause is a finite set of literals (interpreted as a disjunction). A clause of 
the form {-<Li, . . . , -iL„, L(, . . . , L'^} where Li, . . . , Ln, L{, . . . , are atoms is 
often denoted (following the usual sequent-like notation) as a rule: LiA. . .AL„ —>■ 
A V ... V L'^. 

We also require that each of the clauses be range-restricted i.e. all the vari- 
ables occurring in the positive or equational literal of a clause C must also occur 
in a negative non-equational literal in C (this is an generalization of the standard 
notion of range-restricted clauses) . 

Range-restricted clauses are expressive enough to state all the geometric ax- 
ioms and theorems that we have to deal with in practice. Moreover it is well 
known that any set of clauses can be transformed into an equivalent set of 
range-restricted clauses (provided some new predicate symbols — the so called 
domain predicates — are added to the signature) . 

More formally: 

Notation. Let C be a clause. We denote by Var{C) the set of variables occurring 
in C. 

Definition 4. Let C he a clause. We denote by C~ (resp. C~^) the set of negative 
(resp. positive) literals occurring in C . We denote by the set of equational 
literals in C and by the set of non-equational literals in C. C~^ 

and have the obvious meaning. 

A clause C is said to he positive (resp. negative^ iffC~^ = C (resp. C~ = C). 

A clause C is said to he range-restricted iff Var{C'^) C Var{C~) and if 
Var{C~^) C Var{C~^^). 

As we shall see, the restriction to range-restricted clauses allows to simplify 
the proof process especially in conjunction with the use of (positive) hyper- 
resolution. It is interesting to remark, for example, that any positive range- 
restricted clause must be ground (indeed, if C is positive, then C~ is empty hence 
Vor(C'“) = 0 therefore we must have Vor(C'+) = 0, i.e. Var{C) = 0). Moreover, 
since positive hyper-resolution only produces positive clauses, it implies that 
only ground clauses will be generated by the application of the hyper-resolution 
rule. 

2.2 The Calculus 

In this section, we describe the calculus used by HOARD ^tinp- It is based on 
forward chaining (bottom up reasoning), using a variant of the hyper-resolution 
-|- positive ordered paramodulation rules. 

We introduce a new inference rule, called if-hyper-resolution, that combines 
hyper-resolution with equality reasoning and rewriting techniques, if-hyper- 
resolution can be seen as a macro inference rule combining in a single inference 
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step, several applications of the positive ordered paramodulation and positive 
resolution rules (as defined for example in 0). This rule has been designed in 
order to deal with range-restricted problems, especially if the equational part is 
not very important, relatively to the non-equational part (which is the case for 
many of the problem we have to treat). 

Let us first recall the definition of the (well-known) ordered paramodulation 
rule. As usual L[t]p denotes a literal obtained from L by replacing the term at 
position p by t. We assume an order (noted <) defined on (ground) terms and 
(ground) literals. 

Positive Ordered (PO-) Paramodulation: 

{t = s}UR {L[t]p}L\R' 

{L[s]p} U RU R' 

where R, R', L[t]p are positive, t = s > R, L[t]p > R' and s <t. 

Remark 3. Notice that unlike the standard definitions (see for example | 32 |)) 
this rule is applied only on positive clauses, hence (since the clauses are range- 
restricted, see Definition 4) only on ground clauses. Therefore, no unification is 
needed as in the general definition of paramodulation. 

In order to define the if-hyper-resolution rule, we must first define the fol- 
lowing EC -unification procedure. It performs a (restricted) form of (conditional) 
if-unification. Given a set of clauses S and two terms t, s, it tries to compute a 
clause C and a substitution 9 such that S |= (-iC tO = s). -iC can be seen 
as a condition which is sufficient to prove that S \= t0 = s. Not all solutions 
are computed (this would be obviously not possible in general) but the given set 
of solutions will be still sufficient to ensure the refutational completeness of the 
whole calculus. The EC -unification procedure is given in Figure 1. 

Remark f. The tests ^CE,C is unsatisfiable” and ^CE,-^R,t = /(si, . . . ,Sn),C 
is satisfiable” are propositional satisfaisability tests. 

Since the EC -unification procedure is nondeterministic, we must specify how 
this nondeterminism is handled. Actually, the choice of the pair (t, s) and that 
of the application of t',t",R can be done arbitrarily, i.e. using a “don’t care” 
nondeterminism. No backtracking is needed. Some heuristic are used to prune 
the search space (for example, equations with the smallest number of variables, 
or equations that are not likely to be rewrited, are chosen first). In contrast, 
the choice in line 25 must be done using a “don’t know” nondeterminism, which 
requires backtracking and will therefore provide a set of solutions rather than a 
unique solution. 

Remark that the instruction in line 34 corresponds exactly to the usual de- 
composition rule of the unification algorithm, i.e.: 

n 

f{ti,...,tn) = f(t[,...,0 ^ f\U=t[ 

2 = 1 
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1 Procedure EC -unification 

2 INPUT : 

3 A set of clauses S 

4 A set of equations S' 

5 OUTPUT : 

6 A substitution 6 of VaHt) and a positive ground clause E 

7 or a flag “no solution” . 

8 Begin 

9 P--S' 

10 61 — 0 

11 E~± 

12 C — 0 % C is a set of conditions, initially empty 

13 % Remark: the purpose of C is only to reduce search space by avoiding 

14 % irrelevant computations 

15 While P / 0 

16 If ->E,C is unsatisfiable Then Return(“no solution”) 

17 Choose (t = s) G P 

18 If i is a variable Then P := P \ {t = s),0 := 9 U {t ^ s}, P := P9,C := C9 

19 % Replacement rule 

20 Else If s is a variable Then P: = P\(t = s),6 := 6u{s^t}, P : = P9, C:=C9 

21 % Replacement rule 

22 Else % t is of the form /(ti, . . . , t„) 

23 If 3/(si , . . . ,Sn)=t"uRGS such that /(si, . . . ,tn)=t" > R, /(si, • • • ,Sn)>t" 

24 and -iP, -iP, t = /(si, . . . , s„),C is satisfiable 

25 Then % Conditional narrowing rule 

26 Or Begin 

27 P~P\{t = s)U {t" = «} U Ur=i{ii = Si} 

28 E--EVR 

29 C :=CU {f{si,...,Sn) =t} 

30 End 

31 Or 

32 C :=CU{PV Vr=iSi 

33 Else If t = f{ti, . . . , tn) and s = . . . , t'„) 

34 Then P := P \ {t = s) U {ti = \ i G [l--n]} % Decomposition rule 

35 Else If t = f(ti, . . . , tn) and s — g{t '\, . . . , t'm) and f g 

36 Then Return) “no solution”) % Clash rule 

37 EndWhile 

38 Return)^, E) 

39 End 



Fig. 1. The EC -unification procedure 
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whereas the instruction in lines 18-20 and 35 corresponds respectively to the 
replacement and clash rules. 

Finally, the instructions in lines 23-30 correspond to an application of a 
conditional narrowing rule, i.e. a term t is transformed into a term t" if there 
exists a clause t' = t" y R and a substitution 6 such that t9 = t' and provided 
that the condition R is added to the resolvent. 

Now, we have all what we need to define the if-hyper-resolution rule. 

Vi=l = Sj — >■ C S 
C9\JE 

where (0, E) := EC -unification{S , {ti = xi, si = xi, . . . , = a;„, s„ = a;„}). 

The following example illustrates the use of the if-hyper-resolution rule. 

Example 1. We consider the following set of clauses: 

1 {~'(j>{f{x)) = true),r{x) = true} 

2 {f{a) = b}UR 

3 p{b) = true 

Let us apply the if-hyper-resolution on clause 1. First, the EC -unification 
procedure is called on the equation p{f{x)) = true. Here the conditional nar- 
rowing rule can be applied on p{f{x)) using clause 3. We obtain two distinct 
problems: 

— {true = true, f{x) = b}; 

— or {p(/(x)) = true} with the condition f(x) b. 

The second problem can be deleted, since narrowing cannot be applied any- 
more at root position in the term p{f{x)), and since p{f{x)) is not unifiable with 
true (lines 23 and 35 of the EC -unification procedure). Hence, it only remains 
to solve the first problem. The first equation is trivial and can be deleted imme- 
diately (a particular case of the decomposition rule, lines 33-34) . For solving 
the equation f{x) = b, we can again apply the narrowing rule on /(x) = b and 
clause 2. We obtain the following problem (the other branch can be immediately 
deleted since f{x) is not unifiable with b): 

{b = b,x = a}, with the condition {i?}. 

b = b is valid, and x = a can be solved immediately by replacing x by a 
(replacement rule, lines 18-20), hence we obtain the following solution: {x -y a}, 
with the condition R. 

Therefore, the if-hyper-resolution rule can be applied and gives the clause: 

r(a) V R 

Notice that this clause may have been generated by 2 steps of paramodulation 
into clause 1, followed by one step of hyper-resolution between the obtained 
clause and the reflexivity axiom x = x (needed when paramodulation is used). 
The if- hyper-resolution allows to merge these 3 steps in a single rule. More 
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important, it also allows to restrict the search space: if we replace for example 
the clause p{b) = true by p(/(a)) = true, then the if-hyper-resolution would not 
be applicable any more. Indeed, the reader can easily check that the instance 
p(/(a)) of p{f{x)) would still be rewritten as p{b) hence would not be unifiable 
with p{f{a)). This does not threaten refutational completeness, since p{b) V 
R may actually be generated from p{f{a)) and /(a) = 6 V i? by using PO- 
paramodulation. 



Theorem 1. The calculus defined by the E-hyper-resolution (with the EC -uni- 
fication procedure) and positive ordered paramodulation is sound and refutation- 
ally complete, i.e. for all sets of clauses S, S is unsatisfiable iff there exists a refu- 
tation from S using only the E -hyper-resolution and positive ordered paramodu- 
lation rules. 

Proof. Soundness. The soundness of the positive ordered paramodulation rule 
is well known. We only have to prove that the i?-hyper-resolution is sound. 
To this purpose, it suffices to show that for all set of equations P = {ti = 
Xi,si = Xi,...,tn = Xn,Sn = Xn}, if EC -unification{S , P) = {0,E) then we 
have S, -•E |= P9. Indeed, in this case, the clause C9 U E must be a logical 

consequence of the clause S' U {ti = si, . . . , = s„ — >■ C}. 

We denote by Pn the set of equations obtained at set n, by the set of 
atoms, by C„ the corresponding set of conditions, and by the corresponding 
substitutions. We show by induction on the number of iterations, that at each 
step, and for any model I of S U we must have X ^ S'6*„ if I ^ P9n. 
When the procedure terminates, we must have P = T hence X must be a model 
of S'9n. 

— Base case. The proof is immediate, since Pq = S'. 

— Inductive case. Assume that the property holds at step n. We show that 

it holds at step n -\- 1. It suffices to show that S, \= (P„6*„+i <^= 

Pn+i9n+i). Let t = s be the equation selected by the procedure at step 
n -I- 1. We distinguish several case, according to the form of t, s. 

• If t (resp. s) is a variable, then we have, by definition, = 9nD{t — >■ s} 

and = Pn\{t = s}. Hence the proof is immediate. 

• If t = f{ti, . . . ,tn) and there exists a clause /(si, . . . , s„) = t" U R G S 
such that /(si, . . . , s„) = t" > R, f{s\, . . . , s„) > t" then we distinguish 
two cases according to the chosen OR branch. 

1. Rewriting branch. We have 0„+i = 0„, P„+i = P„\{(t, s)}U{t", s}U 
Ur=i{^* = En+i = P„UP. Let CT be a ground substitution of 

P„_i_i0„_|_i and let P be a model of S, P„+i0„+i. We must have 

X 1= -iP and X \= /(si, . . . , s„) = t" \J R hence X |= /(si, . . . , s„) = 
t" . Moreover, we have Vt G [l..n],P \= {ti = Si)0„+icr. Hence X \= 
/(si,...,s„) = f{ti,...,tn)9n-\-i<J. Therefore X \= {t = t")6»„+icr. 
Since we must have X ^ {t" = s)0„+i(j, we deduce that X \= {t = 
s)9n+ia hence that X |= P„0„+icr. Therefore, we have S, ->En+i |= 
{PnO n+1 ^ Pn+l 
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2. Non-rewriting branch. The proof is immediate, since Pn+i = Pn and 

^n+l — ^n- 

• If t = f{ti, . . . ,tn) and s = /(si, . . . , s„). The proof is immediate, by 
soundness of the decomposition rule. 

• If t = /(ti, . . . , tn) and s = g{si , . . . , Sm)- The proof is immediate, since 
there is no solutions. 

Refutational completeness. We first have to show that the EC -unification 
procedure terminates. By definition of EC -unification, any term occurring in Pn 
at any step n is either a subterm of a term occurring in S' or a ground term 
occurring in S (since any positive clause in S is ground). Therefore, the number of 
distinct clauses of the form RV Vr=i /(si, • ■ ■ , Sn) = t”V R G S 

and /(ti, . . . ,tn) occurs in Pn is finite. Therefore, we may assume, w.l.o.g. that 
no new clause is added into C. 

Then, we consider the following measure on the set of equation 
I(P) = = s),t= s)\{t,s) G P} 

where s(t = s) denotes the number of variables in t = s. Equations are or- 
dered using the ordering < and P{P) is ordered using lexicographic and multiset 
extensions of the orderings on s{t = s) and t = s. 

Then at each step, it is easy to see that either I{Pn) decrease strictly. Indeed, 
either a variable is instanciated and v decreases strictly, or the value of the terms 
in t = s decreases strictly w.r.t. <. 

Now, we prove that the P-hyper-resolution and ordered positive paramodu- 
lation rules are refutationally complete, using the technique introduced in ini, 
namely the transfinite tree method. 

Let A be the set of ground literals built on the signature E, =. Clearly, (A, <) 
is isomorphic to an ordinal uj. For any ordinal a < w, we denote by Aa the literal 
corresponding to a. 

We build — by transfinite induction — a sequence of partial interpretations 
Xa (i.e. of partial functions from A into {true, false}) as follows: 

— Base case. Xq = 0. 

— Limit ordinal. X^ = U/3<o!^/3 if ct is a limit ordinal. 

— Successor case. Iq+i is defined as follows. 

• If Aa = {t = t) then Xa+i =XaGI {(t = t) — >■ true}. 

• If Aa = L[t\p and there exists a literal t = s such that Xa(t = s) = true 
and t > s, then Xa + l{L[t]p)=Xa{L[s]p). 

• If Aa = L and there exists a positive clause L' y R and a ground substi- 
tution a such that L = L'a and Xa ^ ~'R then Xa-^-i{L) = true. 

• Else Xa{L) = false. 



Remark 5. The reader should note that the sequence {Xa)a<bj is monotonic i.e. 
for all /3 < a Xa is an extension oiXp. 
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Assume that □ ^ S' and that S is irreducible by £l-hyper-resolution and 
ordered positive resolution. Then, we are going to show that \= S. We denote 
by Sg be the set of ground instances of clauses in S. 

Assume that S. Then, there exists a clause C and a ground substitution 
cr such that \= ~'Ca. W.l.o.g. we assume that C, a are chosen in such a way 
that Ca is the smallest clause (w.r.t. <) having this property. 

— Assume that C is positive. Ca is of the form LM R where ^ -li?. By 
construction of there must exists a term t' occurring at position p in L 
such that validates an equation t' = s' (else X^ would necessarly validate 
the maximal literal in Cct which is impossible). Therefore, there must exist 
a term t' and a clause t' = s' V R' € Sg such that X^^ ^ (t' = s') V R' , t' > s', 
{t' = s') > R' and t' occurs at a position p in L. By irreducibility w.r.t. the 
ordered positive paramodulation rule, this implies that Sg contains the clause 
L[s']p V RV R'. But we have X,^ ^ L[^']p X RV R' and L[s']p V i?V i?' < Ca. 
Hence this is impossible. 

— Therefore, C must contain at least a negative literal. C is of the form 
Vr=i ^ SiV R where R is positive, ti, si, . . . , s„ are terms, n > 1. 

We are going to show that the application of the A-hyper-resolution rule 
must yield a clause of the form Ra U E, where ^ E. 

By definition of the if-hyper-resolution rule, we have to show that the ap- 
plication of the EC -unification procedure on the input S,{ti = xi,s\ = 
xi, . . . ,t„ = Xn, Sn = Xn} yields the output {a', E), where a' = a U {xi — >■ 
tia} is an extension of a and X,^ ^ E. 

We define the following strategy for the choice of the OR-branch. The 
first branch is chosen iff R is falsified by X,^ and X^ ^ /(fi, . . . , t„)cr = 
/(si, . . . , s„). Else the second branch is chosen. 

Then, by definition, at each step, any atom in E must be false in X^^. More- 
over, we prove (by induction on n) that at each step n, we have: H C„cr' 

and X^ ^ Pji9ri+\a Pji-\.\9jij,.\a . 

Let t = she the equation considered at step n. We distinguish several cases, 
according to the form of t = s. 

• If t or s is a variable then the proof is immediate. 

• If t = f{ti, . . . ,tn) and there exists a clause /(si, . . . , s„) = t" U R € S 
such that /(si,...,s„) = t" > R, /(si,...,s„) > t" , ~^En,^R,C„ ^ 
Ar=i ti = Si- Since /(si, . . . , s„) = t"VR is positive, it must be validated 
by To;- 

1. If the first OR-branch is chosen, then this means — by definition 

on the chosen strategy — that we must have X^^ ^ /(sij • ■ • > Sn) = 
t" (since X,,j \= ~^R and X^^ \= f{s \, . . . , s„) = t" V R) and X,^ \= 
{ta' = /(si, . . . ,s„)). Hence if X,^{t = s)a' then \= {t" = s) A 
(A"=i = Si). Therefore X^^ ^ (P„0 n-t-l ^ Rn-t-l^n-t-l)- 

2. Else, we must have either X^^ |= i? or X^^ \= /(si, . . . , s„) yf ta', hence 
X^ )= C„+icr'. 

• If t = f{ti, . . . , tn) and s = /(si, . . . , s„) and we are not in the previous 
case. Assume that there exists a clause /(s^, . . . , s(j) = t" U R £ S 
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such that = t" > R, f{s[,...,s'^) > t" and ^ 

f{s'i , . . . , sjj) = t” and ^ (ter' = f{s'i , . . . , s'„)). Then, by definition, 
we must have ~<En, ~'R,Cn \= t ^ , s'„). But this is impossible, 

since we must have X^^ ^ ~'En, ~'R,Cn and X^^ |= ta' = /(s'l, . . . , s'„). 
Therefore, there is no clause in S satisfying the above property. By defini- 
tion ofI(j, this implies that X^^ \= {t = s)a' iff Vi G [l..n],X^ ^ tia' = Sj. 
Hence we must have X^^ ^ (P„0„+i =i> P„_|_i0„+i). 

• If t = /(ti, . . . , tn) and s = g{s \, . . . , Sm) and if there is no clause in S 
satisfying the above property. The proof is similar to the previous case. 
Therefore, since X^ ^ (UiLi(^* ~ Xi A Si = Xi)a' , we deduce that 
EC -unification must have a solution {9, E) and that X^^ Y= E. Then, by ir- 
reducibility w.r.t. the P-hyper-resolution rule, this implies that RaUE G S 
hence (since R and E are positive and ^ E) that X^^ ^ Ra. Since 
Xi^ C a this is impossible. 

Therefore, the P-hyper-resolution rule is refutationally complete. 

Using range-restricted clauses together with P-hyper-resolution and ordered 
paramodulation has important consequences: in particular, only ground positive 
clauses may be deduced, hence the efficiency of the usual algorithms for indexing 
terms and clauses, and checking redundancies can be improved drastically. For 
example, checking forward and backward subsumption (which is well known as 
a difficult and costly problem in the general case) becomes very easy: it suffices 
to compare the two lists of literals. If the clauses are sorted (w.r.t. <) before 
being stored into the database (which can be done in n x ln{n) where n is the 
length — i.e. the number of literals — of the clause), subsumption tests may be 
done in linear time. 

P-hyper-resolution is especially useful when the number of equational literals 
is not very important w.r.t. the size of the clause set, which is the case in most 
of the problems we have to deal with. 

Remark 6. The fact that the clauses are range-restricted is essential for the ter- 
mination of the EC -unification procedure, hence for the refutational complete- 
ness of the method. 

2.3 Mixing Up Backward and Forward Proof Search 

HOARD atinp has been enriched with a special feature allowing to combine E- 
hyper-resolution with a form of top-down reasoning (as it is performed for ex- 
ample by Model-Elimination-based theorem provers or by SLD-resolution and 
by Prolog). 

This is done by adding a special kind of (equational) constraints to the 
clauses. We consider constrained clauses of the form [C | T] where C is a clause 
(in the standard sense) and X a conjunction of equations of the form t = s. C 
is the clausal part of [C | T] and X is the constraint part. 

The PO-paramodulation rule can easily be extended to constrained clauses, 
as follows. 
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Constrained Positive Ordered paramodulation: 

[{t = 4 u i? I df] [{L[t]p} uR'\y] 

[{L[s]p}U RU R' \X hy] 

where i?, i?', L[t]p are positive, t > s, t = s > R, L[t]p > R', u[t]p > v. 

From a semantic point of view, a constrained clause of the form [C \ Vi=i = 
Si] is equivalent to the clause V^i ti ^ SiW C. However, the constraint will 
not be handled in the same way than the clausal part by the EC -unification 
procedure. More precisely, instead of including as usual this condition into E, 
the procedure will include it into the set of considered equations. Therefore, 
the EC -unification procedure will evaluate all the equations belonging to the 
constraint part of the clause in order to find solutions to these conditions, before 
deducing the corresponding if-hyper-resolvent. This is done recursively, i.e. the 
evaluation of the constraint may lead to the consideration of other constrained 
clauses, hence may entail another application of the EC -unification procedure. 

For this purpose, the condition “If 3/(si, . . . , s„) = t" U i? G S'” in the 
EC -unification procedure (line 23) must be replaced by “If 3[/(si, . . . , s„) = 
t” iJ R I X] G S” and the following additional instruction has to be added to 
the EC -unification procedure, just before EndWhile (line 37), in order to deal 
with the constraint part of the clause: 

V — VUX 

This technique allows to integrate in a very natural way useful features of 
backward reasoning (in particular the building of goal-oriented derivations) into 
the EC -unification procedure. This strongly reduces the number of generated 
clauses, with reasonable computation cost. 

A drawback of this technique is that the EC -unification procedure may not 
terminate, hence refutational completeness may be lost. In order to avoid non- 
termination in some particular cases, the system keeps track of the goals pre- 
viously considered in order to avoid entering into infinite loops. Obviously, this 
cannot prevent non-termination in the general case. It is up to the user to choose 
carefully which equational conditions should be treated by a backward search. 
Currently, we use it only on particular axioms for which termination is guaran- 
teed. More flexible strategies, including automatic analysis of the clause sets will 
be considered in the future. 

There are two points for which the use of backward reasoning has proven to 
be particularly useful. 

— Definitions of geometric objects. Constrained clauses are especially 
useful for defining objects. For example the definition: “a parallelogram 
is a quadrilateral {A, B,C, D) such that (A,B) || (C,D) and (B,C) || 
(A, £>)” can be translated into the following (non range-restricted!) clause: 
[{parallelogram{{A, B,C, D)) = true} \ parallel{{A,D),{B,C)) = true A 
parallel{{A,B),{C,D)) = true] Then, the mechanism described above 
will enforce the prover to dynamically replace, during proof search, each 
atom of the form parallelogram{{A, B,C, D)) = true by the conjunction 
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parallel{{A, D) , {B , C)) = true A parallel{{A, B) , {C , D)) = true (this tech- 
nique can be seen as an extension of demodulation for non equational liter- 
als). 

— Prolog call. A special predicate “prolog_call” has been introduced: it allows 
to take advantage of all built-in Prolog predicates (for example for arithmetic 
computations). Unifying prolog -call{G) and true during the unification pro- 
cedure will simply cause the evaluation of the goal G using Prolog engine. 
Thus, adding the constraint prolog -call{G) = true to a clause, will make the 
system evaluate the goal G before applying the if-hyper-resolution rule on 
this clause. 

Remark 7. The prolog_call predicate is especially useful for including numeri- 
cal computations into the calculus, or for combining HOARD atinp with existing 
systems (see Section 6). 

2.4 Handling Theories 

We analyze in this section some theories usually underlying geometric reasoning. 
In order to efficiently obtain user-oriented proofs they must be incorporated into 
the proof steps. As expected, experiments have confirmed that symmetry is one 
of the most important among these theories. These examples show the way of 
incorporating new theories in HOARDatinp- 

Equational theories such as commutativity (i.e. [A, B] = [B, A] or {A, B) = 
(B,A)) or circularity (i.e. {A, B,G, D) = (B,G,D,A)) are encoded in the core 
of the unification algorithm. More precisely, the decomposition rule is replaced 
by the following ones, that are specific to some particular functional symbols. 
The EG -unification procedure is modified accordingly: 

f{ti,t2) = fiAA) (ti = A A t2 = A) V (t2 = A V u = A) 

if / is commutative 

f{ti,t2,t3,u) =fiA,A,A,t4) -s- = A Ab+i = A au+2 = A ag+s = A) 

if / is circular and with tj = tj-i if j > 4 

When a clause is generated, the system computes the minimal (w.r.t. <) 
representative of its equivalence class according to the above properties (com- 
mutativity and circularity), and only this particular representative (rather than 
the clause itself) is stored into the database {this is possible since all generated 
clauses are ground). This mechanism significantly speeds up redundancy checks 
(subsumption tests can still be performed in linear time). 

Moreover, special deletion rules are also added, in order to remove clauses 
that do not carry any geometric information, for example clauses containing 
terms of the form [t, s] or {t,s), where t = s {it can easily be shown that this 
does not affect refutational completeness). 

These techniques allow to significantly reduce the number of clauses convey- 
ing exactly the same information. 

Moreover, non-equational theories such as parallelism and collinearity are 
also treated by a special mechanism, based on the use of constrained clauses. 
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More precisely, a new function symbol “dir’’ {dir standing for “direction”) of 
arity 1 is introduced. Its intended meaning is that dir{d) is the equivalence class 
of the line d w.r.t. the ^‘paralleF relation. Therefore, checking whether two lines 
{A, B) and (C, D) are parallel will be done very easily by checking whether 
dir{{A,B)) = dir{{C, D)). Checking whether the points A,B,C are collinear 
is done by checking (for example) that dir({A,C)) = dir((A, B)). This avoids 
adding axioms for the transitivity of the parallelism relation and allows to deal 
with this theory in a more effective way. 

The following clauses are introduced into the clause sets: 

[parallel{{A, B), {C, D)) = true \ dir{{A,B)) yf dir{{C,D))] 

[dir{{A, C)) = dir{A,B)) V dir{{A,B)) yf dir{{B,C)) \ true] 

[collinear { A, B,C) = true \ dir{{A, B)) yf dir{{A,C))] 

[dir{{A,C)) yf dir{A, B)) V {A,B) = {A,C) \ true] 

Another special mechanism, based on the use of ordering constraints, has also 
been added in order to handle clauses that are symmetric w.r.t. a permutation 
of the variables. For example, consider the clause: 

C : perpendicular{x, y) A perpendicular{x , z) — >■ parallel{y, z) 

and the permutation a = {y ^ z, z ^ y}. 

It is easy to see that cr is a symmetry for clause C, since Ca is syntactically 
equivalent to C modulo the commutativity of || and V. Thus, given the two clauses 
perpendicular{a, b) and perpendicular{a, c), two distinct clauses will be generated 
by applying the A-hyper-resolution rule: parallel{b,c) and parallel{c, b) . 

Obviously, these two clauses are equivalent modulo commutativity of ||, hence 
one of these two clauses will be actually deleted by the subsumption algorithm. It 
would be more efficient to prevent the generation of such clauses, instead of dis- 
carding them afterwards. This is done by introducing special kind of constraints 
into the clauses. These constraints express ordering conditions on the variables, 
in order to “destroy” the symmetry, thus preventing the generation of equivalent 
clauses. Here, it suffices to add the constraint x < y, ensuring that x must be 
lower than y according to some (arbitrarily chosen) ordering < in the initial 
clause. This prevent the generation of parallel{b, c) (if 6 > c) or parallel{c, b) (if 
b < c). 

Unification problems are solved modulo these ordering constraints, i.e. the 
system check that the obtained solutions satisfy the constraints. As in Constraint 
Logic Programming, the constraints are solved as soon as the value of the vari- 
ables are known. This allows to decrease strongly the cost of the unification 
algorithm (since several solutions may be discarded) and the number of deleted 
clauses without too much additional computation cost. 

The addition of such ordering constraints may be done automatically. Indeed, 
it suffices to add for each clause C and for each pair of variable (x, y) such that 
C{x ^ y} = C the constraints x < y {or y < x). 

As it is well known, adding such special mechanisms devoted to the handling 
of the considered theories is essential for improving the efficiency of provers. This 
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is of course also the case for HOARD ^tinf- Without them, no proof would be 
obtained in reasonable time for most of the considered problems. 

2.5 Axiomatization 

Our goal is to produce human-oriented proofs, and, in the framework of this 
work, students oriented-proofs. Therefore, complete, but complex axiomatiza- 
tions such as Tarski’s or Hilbert’s, would be useless. 

The axiomatization will mainly depend on the level of knowledge of students. 
It will contain all the geometric axioms that are supposed to be known at a 
certain stage. In order to build our system such geometric axioms have been 
gathered from school textbooks (of the adequate level) and translated into the 
considered fragment of first-order logic. 

Moreover, several axioms have had to be added in order to formalize “default” 
(or “implicit”) knowledge freely used in textbooks, mainly because it is either 
trivial or common sense. Such axioms includes for example basic properties of 
geometric objects (i.e. {A,B) = {B,A)) or basic definitions (i.e. the definition 
of the midpoint of a segment) or trivial reasoning steps about geometric object 
(i.e. if / G (A, B) and B G (A, C) then / G (A, C)). This implicit knowledge has 
been added either as built-in features or formalized as set of first-order clauses. 

This axiomatization is obviously not complete w.r.t. Euclidean plane geom- 
etry, but completeness is not really relevant for our purpose (but soundness is, 
of course, an important issue). 

The axiomatization that we used for all the examples in this paper contains 
141 clauses. 

2.6 An Example 

Remark 8. It is worth noticing that despite numerous attempts, using different 
strategies, no proof of the simple theorems stated in Examples El El and 0 has 
been obtained with the well-known generic theorem prover OTTER (of course 
we have used the same axiomatization). OTTER ran out of memory after some 
hours of computation having generated thousands of clauses. 

Remark 9. All the examples treated by HOARD atinp given in this paper run on 
a Pentium 200 with 64 Mb of memory. 

Example 2 (Problem proposed to 15 years old French high school students). We 
consider the following problem (see Figure|3): Let A, B,Che a, triangle such that 
\AC\ = \BC\. Let I be the middle of [B,C], J be the middle of [A, C] and K 
the middle of [A, R]. Let E and F the middle of [A, A] and [K,B] respectively. 
Prove that (A, J, /, F) is a rectangle. 

HOARD atinp computes a proof automatically in about 20 seconds 217 clauses 
are generated and 2432 are generated and discarded by forward subsumption. 

The proof (in pseudo-French) is given in Annex A. Note that several of the 
considered steps are simple applications of basic definitions that are trivial for 
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Fig. 2. A figure for Example |3 



human beeings, and thus omitted in human’s proofs (for example E is the mid- 
points of (A,K) and (K,E) implies that (A,K) || {K,E)). Of course, these 
steps could have been removed by the system as well, but we prefer to make 
them explicitly for the sake of completeness. 

A more careful analysis of the proof also shows that several subparts are 
highly similar. They correspond to applications of the same sequence of geomet- 
ric theorems on different points. In order to reduce the size of the proof — thus 
making it more readable — other techniques are necessary to detect such sim- 
ilarities. We have already developed these techniques, see Section |^1 (Remark 
inj for more details. 

Examples. We consider a parallelogram (A,B,C,D) such that d{A,C) = 
d{B,D). Let E,E,G,E[ the midpoints of [A, B], [B,C], [C\ D], [D,A] respec- 
tively. Prove that {E, E, G, El) is a square. 

The proof of this theorem generated by HOARD atinf is given in Annex B. 

The next example (taken from 0) is interesting because its proof needs 
the introduction of new auxiliary points. It allow to compare features of 
HOARD atinf and GEX on a running example. 

Example 4 (taken from page 240, originally given in m)- 

We consider a trapezoid (A,B,C,D) with (A,B) || {G,D). M,N are the 
midpoints of [A, G] and \B, D] respectively. E is the intersection of (M, N) and 
{G,B). Prove that E is the midpoint of [G,B]. 

In order to construct the proof, the midpoint of [A, D] must be constructed. 
In our formalism, this entails the use of a clause corresponding to a constructive 
axiom, of the form: 

~'point{A) V -ipoint{B) V midpoint{m{A, B), [A, B]) 

This axiom comes from the skolemization of the following formula: 

~^point{A) V -ipoint{B) V {3M)midpoint{M, [A,B]) 
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m is the skolem function introduced from (3M). 

This technique avoids explicit generation of auxiliary points (as in GEX |^). 
See Section 0 for more details about this problem. 

However, it is not reasonable to keep such skolem terms in the proof pre- 
sented to the user, because they will darken the proof. Therefore, HOARD atinp 
eliminates them after the proof has been generated and automatically replaces 
the introduction of these terms by the explicit construction of new points (using 
anti-skolemization techniques) . The proof it gives is the following (the translation 
into pseudo-french is too verbose, we do not give it). 

Note that non-degeneracy conditions are proven here and not assumed from 
a given figure (see Section EJ. This explain the difference in length between 
the HOARD atinp ’ s and GEX’s proofs. As previously mentioned for “implicit” 
axioms, the corresponding parts of the proof need not to be explicitly displayed 
to the user, which makes the proof shorter and more readable. 

there_exists (p7 .midpoint (p7, segment (D, A) ) ) 

$ 

midpoint (N , segment (D , B) ) 

$ 

parallel (segment (p7 ,N) , segment (A.B) ) 
midpoint (p7 , segment (D , A) ) 
midpoint (N , segment (D , B) ) 
midpoint s_theorem 
$ 

non(D = A) 

non(collinear(D, A.B) ) 

$ 

non(p7 = D) 

midpoint (p7 , segment (D , A) ) 
non(D = A) 

$ 

collinear (N,D,B) 
midpoint (N , segment (D , B) ) 

$ 

collinear (p7 , D , A) 
midpoint (p7 , segment (D , A) ) 

$ 

non(p7 = N) 
non(p7 = D) 
collinear (N,D,B) 
collinear (p7 , D , A) 
non(collinear(D, A.B) ) 

$ 

parallel (segment (p7 ,N) , segment (D.C) ) 
parallel (segment (p7 ,N) , segment (A.B) ) 
parallel (segment (D.C) . segment (A.B)) 
transit ivity_parallelism 
$ 

parallel (segment (p7 .M) . segment (D.C) ) 
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midpoint (p7 , segment (D , A) ) 
midpoint (M, segment (A,C) ) 
midpoint_theorem 
$ 

parallel (segment (p7 ,N) , segment (p7,M) ) 
parallel (segment (p7,N) , segment (D,C) ) 
parallel (segment (p7,M) , segment (D ,C) ) 
transit ivity_parallelisme 
$ 

collinear (p7 , N , M) 

parallel (segment (p7,N) , segment (p7,M) ) 

$ 

droite(p7,N) = droite(N,M) 
non(N = M) 
non(p7 = N) 
collinear (p7,N,M) 

$ 

non(D = B) 

non(collinear(D, A,B) ) 

$ 

non(N = B) 

midpoint (N , segment (D , B) ) 
non(D = B) 

$ 

non(collinear(D,C,B) ) 

$ 

non(N = E) 
non(N = B) 
collinear (C,B,E) 
collinear (N,D,B) 
non(collinear(D,C,B) ) 

$ 

droite(N,M) = droite(N,E) 
non(N = E) 
non(N = M) 
collinear (N,M,E) 

$ 

non(A = C) 

non (coll inear ( A, C,B) ) 

$ 

non(M = C) 

midpoint (M, segment (A,C) ) 
non(A = C) 

$ 

collinear (M , A , C) 
midpoint (M, segment (A,C) ) 

$ 

non(M = E) 
non(M = C) 
collinear (C,B,E) 
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collinear (M , A , C) 
non(collinear(A,C,B) ) 

$ 

droite(N,M) = droite(M,E) 
non(M = E) 
non(N = M) 
collinear (N,M,E) 

$ 

droite(N,E) = droite(M,E) 
droite(N,M) = droite(M,E) 
droite(N,M) = droite(N,E) 

$ 

droite(p7,N) = droite(M,E) 
droite(p7,N) = droite(N,M) 
droite(N,M) = droite(M,E) 

$ 

parallel (segment (M,E) , segment (A,B) ) 
parallel (segment (p7,N) , segment (A,B) ) 
droite(p7,N) = droite(M,E) 

$ 

midpoint (E , segment (C , B) ) 
collinear (C,B,E) 
non(collinear(A,C,B) ) 
parallel (segment (M,E) , segment (A,B) ) 
midpoint (M, segment (A,C) ) 
midpoint _theorem 
$ 



3 Some Additional Features 

In this section, we describe some other original features of HOARD atinp and we 
give examples of applications of these features. Some of the ideas behind these 
capabilities have been proposed in the field of automated deduction since many 
years ago and some techniques have been presented (Bledsoe, Bundy, . . . ), but 
the techniques proposed here are original in a large extent and, as far as we 
know, combined for the first time. 

3.1 Proof Verification 

HOARD atinp can be used to check the alleged “proofs” built by the students, 
in order to detect incorrect (in particular incomplete) “proofs” . However, the 
system does not merely check the soundness of the proof is the standard sense. 
Indeed, it is well known that human beings do not in general write “complete” 
proofs: several steps are usually considered implicit, often because they are triv- 
ial, and also because making them explicit would hide the main ideas and reduce 
proof readability. 




288 Ricardo Caferra, Nicolas Peltier, and Francois Puitg 



Proof check is performed as follows: 

— In the tutorial system CABRI-EUCLIDE (see Introduction), quite naturally, 
each step of the proof must be specified by a partial conclusion C, a set of 
hypothesis 'H and a geometric theorem (or definition) T. 

— In order to check that the step is correct, HOARD ^tinp tries to compute a 
proof of C given 'H, using both the set of axioms corresponding to T and the 
“default” (“implicit”) axioms i.e. the theory that are “well known” in the 
state of the student’s knowledge (see Section ISJ. 

— If a proof is found, then the proof step is validated. Otherwise, further anal- 
ysis may be performed in order to find an explanation of why the proof is 
not correct. For example: 

• Checking whether other theorems can be used for deriving the desired 
conclusion. 

• Try to identify missing hypotheses that could be used to prove C. 

• Construct a partial counter-example if the step is incorrect. 

Example 5. We consider the following proof step (taken from a student’s “proof” 
built using CABRI-Euclide |2H)- 

“{A, B,C, D) is a parallelogram, I is the midpoint of [B,D], Hence I is 
the midpoint of [A, C] (since the diagonals of a parallelogram intersect in their 
middles)” . 

Here the assertion “I is the midpoint of \A, C]” cannot be deduced from the 
hypothesis. The proof step is not correct (sound) since the conclusion is not a 
logical consequence of the premises. 

The system tries to computes (i.e. to abduce) a set of additional hypothesis 
allowing to prove the desired result. This is done by backward chaining, starting 
from the conclusion and trying to compute a sufficient set of hypothesis (w.r.t. a 
given set of axioms). Here it detects that the hypothesis -^eollineardA, B, C, D}) 
(i.e. a nondegeneracy condition) is sufficient to complete the proof. Depending 
on the context, the system could accept the proof step as it is, point out the 
missing hypothesis to the user, check whether the missing hypothesis is valid or 
not, etc. 

Remark 10. -icollinear{{A, B, (7, D}) is an example of hypothesis which is often 
omitted in human proofs. 

3.2 Model Building 

When the validity (respectively contradiction) of a formula cannot be proved, 
it is very useful to get a counter-example (respectively model) of it, in order 
to provide a eonvincing evidence of the non-validity (non-contradiction) of the 
formula. 

In this section, we show how to extract models from clause sets that are 
saturated (i.e. no new clauses can be derived) under E-hyper-resolution. This 
technique can be seen as a generalization of the method proposed in [23r22j for 
a particular class of clauses. 
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Procedure ModelBuilding 
INPUT A set of clauses S 

OUTPUT A set of ground equations E such that E \= S 

Begin 

Let U := 0 

Let = {C e 5 I C = Vr=i ti = Sz} 

While Se^9 
Begin 

Let C := min(S'£;) % w.r.t. the multiset extension of the order < 

Let L := max(C') % w.r.t. the order < on literals 
E := EU{L} 

Se = {C G S \ C = vr=i^i = ^ ^E sj 

End 

End 

Theorem 2. Let S be a set of clauses saturated under E -hyper-resolution and 
ordered paramodulation together with standard deletion rule: (conditional) de- 
modulation and subsumption. Then ModelBuilding{S) terminates. Moreover, 
ModelBuilding(S) is a model of S. 

The models constructed by the ModelBuilding procedure are Herbrand mod- 
els (i.e. defined on the set of ground terms). Obviously, it is more useful — in 
the particular context of geometry — to construct diagrams as models. Hence, 
we have added a special mechanism to transform automatically Herbrand mod- 
els into geometric diagrams. This entails numerical solving of sets of equations 
and inequations, since we have to find the value of the coordinates such that 
all the assertions in the Herbrand model holds (i.e. we must find a solution of 
a system of equations and inequations). To this purpose, we use MAPLhfl, to- 
gether with specific heuristics devoted to numerical solving of inequations. The 
pre-generation of Herbrand model (or at least of a partial Herbrand model) is 
useful (and often crucial) to guide the numerical solver. 

Example 6. Assume we want to build a counter-example for the wrong “proof” of 
Example 0 We have to find a model satisfying “{A, B, C, D) is a parallelogram” , 
“I is the midpoint of [H,U]” but not “I is the midpoint of [A,C]’\ 

During the generation of the Herbrand model, we deduce in particular the 
assertion: “collinear{{A,B,C,D}y^ which is a logical consequence of the set of 
hypothesis. Then, all these assertions are translated into sets of equations and 
inequations. We obtain: 

1 Xi = {Xb -t Xd)I‘2 i.e. I is the midpoint of [B,D] 

2 Yi^{Yb+Yo)/2 

3 Xa — Xb = Xd — Xc i.e. A, B, C,D is a. parallelogram 

4 Ya-Yb=Yd-Yc 

5 Xj A {Xa -f Xc)j2 i.e. I is not the midpoint of [A, C\ 

^ HOARD atinp has not yet integrated a symbolic computation system, MAPLE is 
used here in an ad hoc manner. 
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6 Yi^{Ya + Yc)!2 

7 (Xa - Xb) X {Ya - Yc) = (Xa - Xc) x {Ya - Yg) i.e. colhnear({A, B, C}) 

8 {Xa - Xb) x (Ya - Yd) = {Xa - Xd) x {Ya - Yg) i.e. collmear{{A, B, D}) 

We use MAPLE to solve the system {1,2, 3,4,7, 8}. We obtain a parame- 
terized solution of the system, that can be instantiated by arbitrary values in 
order to find the coordinates corresponding to a diagram which is the graphical 
realization of the obtained Herbrand model. Notice that this would not have 
been possible if the assertion “ collinear{{A, B, C, 7?})” had not been added into 
the set of hypothesis. Indeed, 1,2, 3,4 do not carry enough information to find 
a solution satisfying 5,6. 

3.3 Proof Generalization and Analogical Reasoning 

The analysis of the concepts of analogy and reasoning by analogy has a very long 
tradition in western Philosophy and Mathematics. Analogy seems to be one of 
the most important (informal) techniques used by humans, particularly when 
dealing with mathematical problems (see e.g. psi). Consequently, application of 
analogy to the field of automated deduction seems quite natural. However work 
in this direction is surprisingly scarce albeit the ignorance of analogy by theorem 
provers was early identified as a major drawback for their performances [3| and 
reasoning by analogy is considered as a challenge in automated deduction in 
the Wos’ well-known list |HS|. Of course, a good use of analogy is still a very 
important issue in automated deduction. 

In the present work, we focus our interest on the intrinsic value of analogical 
reasoning for a better understanding of proofs, rather than in its uses to improve 
provers performances. 

The main goals are to be able to verify (possibly proposing some changes) 
the analogies proposed by students and to suggest proofs (or segment of proofs) 
analogous to those proposed by them. Analogy detection is especially important 
for the presentation and structuring of proofs: detecting analogies between parts 
of a proof is a way of introducing lemmata, thus making proofs shorter and more 
readable. Using analogy a priori can be considered as a form of planning. 

The analogy reasoning performed by HOARD atinp is mainly based on the 
method developed in nn for detecting and using analogies in resolution proofs. 
In this section, we briefly recall the basis of this approach, and we explain how 
it has been adapted to geometric reasoning. 

The method relies on generalization techniques. It can be divided into two 
steps: 

I. Generalization step. It occurs just after proving a (new) formula in a 
theory (i.e. a theorem). The formula is transformed into a more general 
formula schema. Predicate and function symbols are replaced by higher- 
order variables, and specific generalization rules are applied, in order to 
transform the considered formula into a more general one, while preserving 
the proof. The idea is to try to find a formula which is more general than 
the original one but that admit a “similar” (i.e. with the same structure) 
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proof. In some sense, the generalization algorithm infers, from a given proof, 
information useful for a larger class of problems. 

2. Matching step. Then the system tries to compare the problem at hand to 
the previously generalized formulae in order to detect potential similarities. 
If such similarities are only partial, then the system tries to infer lemmata 
that have to be proven in order to “complete” the analogy. The calculus for 
solving such matching problems has been presented in mni. It is based on 
higher-order unification techniques. 

We refer to lull, ‘^112! for a detailed presentation of this approach but the 
description we give here should enable the reader to understand how it works. 
The main modifications that have been introduced into the algorithms in order 
to deal more easily with geometry are the following: 

— First, it is clear that some of the function and predicate symbols are part 
of the language, hence should not be generalized at all (i.e. should not be 
replaced by higher-order variables). For example, it would not make sense, 
from a geometric point of view, to detect analogies by replacing the predicate 
^‘paralleF by the predicate ^‘’perpendiculad^ in a formula. Therefore, the ap- 
plication of the generalization rules (step 1) (see also must be carefully 
controlled. Only function symbols without any intended meaning should be 
generalized. Though this restricted generalization reduces the number of po- 
tential similarities, it also has the big advantage to strengthen the constraints 
on the matching problem (step 2), thus making the matching process much 
more easy. 

— Second, the matching algorithm should be performed modulo the particular 
theories of geometry predicates (commutativity, circularity and parallelism) 
as explained in Section o 

As a consequence of these changes in the algorithms, the next example (as 
well as the others in our experimentations) only uses a very weak version of the 
method presented in j1 1 1I .spi . 

Remark 11. Of course the full version of the algorithm could be useful, for exam- 
ple for detecting similarities between proofs in different geometries or in differ- 
ent axiomatizations of the same geometry. Indeed, abstracting the predicate and 
function symbols of the considered axiomatization could be useful for study- 
ing properties of these axiomatization, as well as properties of the considered 
geometric problems. 



Example 7. We consider the following source problem (P). “Let A,B,C,D be 
a quadrilateral. Let I, J, K, L be the midpoint of [A, B], [B, C], [C, D], [D, A] re- 
spectively. Prove that (/, J, K, L) is a parallelogram.” 

This problem is easy and well known. The proof built by HOARD atinf is the 
following: 
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midpoint (I , segment (A,B) ) 

$ 

midpoint (J , segment (B,C) ) 

$ 

midpoint (K , segment (C , D) ) 

$ 

midpoint (L , segment ( A , D) ) 

$ 

parallel (segment (I , J) , segment (A,C) ) 
midpoint (I , segment (A,B) ) 
midpoint (J , segment (B,C) ) 
midpoint s_theorem 
$ 

parallel (segment (L,K) , segment (A,C) ) 
midpoint (L , segment ( A , D) ) 
midpoint (K , segment (C , D) ) 
midpoint s_theorem 
$ 

parallel (segment (I , J) , segment (L,K) ) 
parallel (segment (I , J) , segment (A,C) ) 
parallel (segment (L,K) , segment (A,C) ) 
transit ivity_parallelism 
$ 

parallel (segment (L, I) , segment (B ,D) ) 
midpoint (I , segment (A,B) ) 
midpoint (L , segment ( A , D) ) 
midpoint s_theorem 
$ 

par allel( segment (J ,K) , segment (B ,D) ) 
midpoint (J , segment (B,C) ) 
midpoint (K , segment (C , D) ) 
midpoint s_theorem 
$ 

parallel (segment (I ,L) , segment (J ,K) ) 
parallel (segment (I ,L) , segment (B ,D) ) 
par allel( segment (J ,K) , segment (B ,D) ) 
transit ivity_parallelism 
$ 

par allelogr amme ( I , J , K , L) 
parallel (segment (I , J) , segment (L,K) ) 
parallel (segment (L, I) , segment (J ,K) ) 
parallelogram 
$ 



Then, we consider a new problem, noted {P')- “Let A,B,C be a triangle. 
Let P be a point. We construct the points Pi, P 2 , P 3 as the images of P, PI, P2 
respectively, by the symmetry w.r.t. the points A, B, and C respectively. Let I 
be the midpoint of [P, P3]. Prove that {A, B,C, I) is a parallelogramm.” 
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The interesting point here is that from a human point of view (P') is strongly 
similar (analogous) to (P). We use the system HOARDatinp in order to detect 
the analogy and reconstruct the proof of (P') from the one of (P). 

In a first step the proof of (P) is generalized. Then the system tries to match 
the new problem (P') with (P). In this case the matching does not succeed, 
since (P') is not a mere instance of (P). But a partial matching is possible. 
This matching can be transformed into a total one, modulo a set of lemmata 
corresponding to hypotheses of (P') that can not be matched to those of (P). 
We obtain the following list of hypotheses: 

midpoint (A , segment [XI , X2] ) . 
midpoint (B , segment [X2 , X3] ) . 
midpoint (C , segment [X3 , X4] ) . 
midpoint (I , segment [X4,X1] ) . 

It should be noted that XI, X2 , . . . , X4 are variables. In order to reconstruct 
the proof, we have to find values of XI, , X4 such that the properties above 
hold. 

The last hypothesis can be matched with midpoint (I, segment (P3,P), 
which leads to: 

midpoint (A , segment [P , X2] ) . 
midpoint (B , segment [X2 , X3] ) . 
midpoint (C, segment [X3,P3] ) . 

The remaining hypotheses cannot be matched. However, they can easily be 
proven from the original one, i.e. we have: 

midpoint (A , segment [P , PI] ) . 
midpoint (B , segment [PI ,P2] ) . 
midpoint (C, segment [P2,P3] ) . 

where the variables X2, Xi have been instanciated. Thus, we obtain the following 
proof: 

[partial] ’/, the analogy is only partial 
parallelogramm(A,B,C, I) 
parallel (segment (A,B) , segment (I ,C) ) 
parallel (segment (I , A) , segment (B ,C) ) 

$ 

parallel (segment (A,B) , segment (I ,C) ) 
parallel (segment (A,B) , segment (P ,P2) ) 
parallel (segment (I ,C) , segment (P ,P2) ) 

$ 

parallel (segment (A,B) , segment (P ,P2) ) 
midpoint (A, segment (P,P1) ) 
midpoint (B , segment (PI ,P2) ) 

$ 

parallel (segment (I ,C) , segment (P ,P2) ) 
midpoint (I , segment (P,P3) ) 
midpoint (C, segment (P2 ,P3) ) 
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$ 

midpoint (I , segment (P,P3) ) 

$ 

parallel (segment (I , A) , segment (B ,C) ) 
parallel (segment (A, I) , segment (PI ,P3) ) 
parallel (segment (B , C) , segment (PI , P3) ) 
$ 

parallel (segment (A, I) , segment (PI ,P3) ) 
midpoint (A , segment (P , PI ) ) 
midpoint (I , segment (P,P3) ) 

$ 

parallel (segment (B , C) , segment (PI , P3) ) 
midpoint (B, segment (PI ,P2) ) 
midpoint (C, segment (P2 ,P3) ) 

$ 

midpoint (A , segment (P , PI ) ) 
symetry (P,P1 , A) 

$ 

midpoint (B, segment (PI ,P2) ) 
symetry (PI ,P2 ,B) 

$ 

midpoint (C, segment (P2 ,P3) ) 
symetry (P2 , P3 , C) 

$ 



Remark 12. All these steps — generalization of the original problem, matching, 
instanciation of the variables, proof of the remaining lemmata and reconstruction 
of the whole proof — are fully automated. Computation time is rather short (less 
than Is). The interested reader may refer to | l If I df 1 2j for a detailed presentation 
of the algorithms. 

Remark 13. Analogy reasoning is also very useful for structuring proofs. For 
example in the proof of the Theorem given in Annex A, due to the obvious 
symmetry in the problem, the proof of (E,J) || (C,K) and {F,I) || (C,K) are 
analogous. Using our proof generalisation and matching algorithms in a system- 
atic way on these subproofs could allow to detect such similarities and avoid 
redundancies in proofs. Appropriate lemmata (obtained from the generalized 
subproof) could be automatically introduced in the proof. Obviously, a proof 
presented using lemmata is much more easy to understand and to interact with. 

4 Reasoning from Diagrams: An Example 

Using models and/or counter-examples for guiding proof search is a very natural 
idea that have been considered since the very beginning of Automated Deduction 
H3- It is especially useful in geometry where using diagrams to guide proofs (or 
refutations) is a common attitude among humans. 

A model can be used to check the validity of a given assertion in a particular 
context. Hence, it is often used to discard goals that cannot be proven, because 



Emphasizing Human Techniques in Automated Geometry Theorem Proving 295 



they are false in at least one model. This is especially useful for backward rea- 
soning, since it allows to prune the search space by deleting numerous subgoals 
that cannot be true in all models of the hypotheses, therefore cannot be a logical 
consequence of them (see for example El). 

Models (interpretations) can be also useful in forward reasoning. When look- 
ing for contradiction of set of formulas, if the set can be partitioned in formulas 
satisfied by the interpretation and formulas falsified in it, it is useless to deduce 
exclusively from formulas satisfied in the interpretation. This is the principle of 
the two most popular resolution strategies: support and semantic resolution. 

The set of support strategy m was born almost simultaneously with resolu- 
tion. The set of clauses is partitioned into a satisfiable one and a non-satisfiable 
one (a simple theorem ensures the existence of such interpretations for unsatisfi- 
able set of clauses). Specification of the interpretation is not required. The more 
successful approach for using semantic information in the context of resolution 
theorem proving is surely semantic resolution, introduced by Slagle in [23B1. 
Semantic resolution corresponds roughly, as Slagle pointed out, to the heuristic 
used in the Geometric Theorem-proving Machine uni to prune the proof search 
in plane geometry. It can be seen as an extension of the set of support strategy. 
The principle of semantic resolution is again to discard some clauses that cannot 
possibly lead to a refutation. This is done — roughly speaking — by preventing 
the application of the resolution rule between two clauses that are true in the 
considered interpretation. 

Therefore, in all semantic-based approaches, models play the role of a filter, 
and are used to cut some of the branches in the proof search. As such, this was 
proved to be a powerful and elegant way of guiding proof search and strongly 
improved the performances of theorem provers (see for example m for some 
striking examples). Macro inference rules such as positive or negative hyper- 
resolution may be seen as particular cases of semantic resolution. 

In IdOl models are used in two different ways: to prune subgoals that are false 
in the model (as in H51) and to suggest inferences. This second use is closer to 
what we do in our approach. Reiter also uses the state of the proof to suggest 
changes in the model (e.g. a construction in geometry). 

An interesting recent work that deserves to be mentioned is m- “on Horn 
clause OSHL (the prover written by Plaisted and Zhu) behaves a lot like the 
geometry theorem proving prover of Gelernter et al. which used geometric 
diagrams and backward chaining with Horn clause-like inference rules to guide 
its search.” 

Here we propose to go one step further: instead of using models to discard 
information (or to suggest inferences as in [,'11 )j 1 . we propose to use them a-priori 
for suggesting possible lemmata, in order to structure or to plan proofs. 

Given a set of hypothesis T-L, a goal G (the formula to prove) and a particular 
diagram D (which is a model of T-L), we try to find one (or several) logical 
relations L that are satisfied in D. Then these relations can be seen as lemmata, 
thus suggesting the following (generic) proof plan: 
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— Prove that % models L. 

— Prove that % U {L} implies G. 

This technique may be seen as a way of adding controlled version of the 
(analytic or non-analytic) “cut” rule into the resolution calculus. It is well known 
that clever application of the cut rule frequently amounts to significantly prune 
the proof search and to decrease the size of the obtained proof (see for example 
m- On the other hand, identifying “interesting” lemmata for application of this 
rule is very difficult, since unrestricted application would be obviously impossible 
in practice, due to the huge number of potential candidates. Finding “good” 
lemmata often requires advanced knowledge about the considered domain, and 
is at the very heart of the mathematical activity. 

We consider that the use of diagrams may be a good way to control the 
application of the cut rule, in a realistic and useful way. The following is an 
example of application of this technique. 

Example 8 (Exercise proposed to 16 years old French high school students). We 
consider the following problem. 

“Let {A^B^C) he a triangle. Let M he the midpoint of[A,B], Gi he the mid- 
point of [M,B]. Let G^,G 2 he two points on (A,G) such that CG 2 — G 2 G 3 — 
G 3 A. Let G he a point such that GG — 2/3 x GG\ and let L he the midpoint 
o/[C', M]. Prove that G belongs to (B,G 2 ).” 

It should be clear that the function of models here is different from that men- 
tioned in Section 3.2. In Section 3.2. model building must be been as a way to 
exhibit counterexamples of the initial problem (in refutational approaches for- 
mula are negated) after a failed search. Here, models will be built before the search 
starts and will be used to guide it. Therefore in this case, the pre-generation of 
the Herbrand model is not performed. The system starts by generating auto- 
matically a diagram for this problem (i.e. a model of the hypotheses) using the 
technique presented in Section E3 i-e. the generation of algebraic conditions 
corresponding to a logical axiomatization. 

Then, it tries to generate lemmata by detecting assertions that are true on 
the figure. This is done as follows. 

— First, choose a (finite) set of functional symbols and predicate symbols. Here 
only the elementary predicate symbols parallel and perpendicular and 
the constructor “x,y — >■ (x,y)” (the line between two points x and y) are 
chosen. 

— Then, enumerate all possible atomic assertions (up to a maximal depth) 
that can be constructed on the chosen signature. Assertions that are already 
known or that are equivalent to the considered goal (e.g. (B, G) || (B, G 2 )) 
should be eliminated immediately. 

— Check (using numerical computation) which assertions are true or false in 
the model. 
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Here it generates (among others) the following properties: 

parallel{{B,G 2 ),{M, G 3 )) 
parallel{{G 2 ,I),{M, G 3 )) 
parallel{{B , G), {B, I)) 



Remark 14-- The construction of the diagram and the generation of the lemmata 
can be done automatically in almost negligible computation time on this example 
(HOARD atinf used less than Is). 

The generation of these lemmata suggests the following proof plan: 

1. Try to prove the property G G (B,G 2 ) (i.e. theorem’s conclusion) using all 
available lemmata. 

2. Then, try to prove from B. the lemmata that were actually used in the proof 
generated during Step 1. 

3. Finally reconstruct the proof of G G (H, G 2 ) using the proof built in steps 1 
and 2. 

The first step is actually very easy. It can be done in 4.7 s. 95 clauses are 
generated. The second step is more difficult. The system runs about 50s, and 
generates 1286 clauses (126 are kept, others are deleted by backward or forward 
subsumption). The third step is trivial and can actually be neglected (less than 
Is). 

A rough comparison can be done: proving the initial assertion without using 
the lemmata takes 80s and entails the generation of 2386 clauses. 

5 Related Work 

As already mentioned in Section ^ there are a few logic-based running systems 
for geometry theorem proving. Two important and representative among them 
(an “old” one and a very recent one) are GEOM uni and Geometry Expert 
(GEX) 0. 

GEOM aims at a better better understanding of using Prolog in geometry 
theorem proving. Many of the points investigated in m are the same as those 
investigated in |B| and in the present work. 

The interesting work mu has several limitations with respect to ours, for 
example, GEOM allows to express hypotheses and goals only from a fixed list 
(lines, parallel segments, ...). This makes the system rigid, the user cannot 
specify, e.g. circles. As for diagrams, GEOM uses them both as a filter (as in 
)T5| 1 and also as a guide (i.e. as an example) to propose conclusions. This second 
use is only suggested through a very simple example and is not systematic as 
in HOARD atinp- Of course, our work benefits of the advances of automated 
deduction these last years. 

Our work shares many features with that of Ghou, Gao and Zhang jS| but 
has several important differences. 
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Both systems use a logical approach with forward chaining. Geometry Expert 
is implemented in C, and HOARD atinp is implemented in Prolog. 

We compare both approaches on the basis of their design philosophy and the 
potential capabilities of their underlying methods. 

A comparison from a experimental point of view is part of future work. 

— Axiomatization. The Geometric axiomatization (i.e. the set of geometric 
rules) used by the Geometric Expert is fixed (it is chosen using only geometric 
criteria). 

On the contrary HOARD atinp can be used with different axiomatizations. 
This is needed because the prover has to be used by students with different 
levels of knowledge. 

As explained in Section f2.dl this axiomatization must correspond to the 
set of theorems, definitions and axioms which are available to the students 
in the considered environment. Hence, it cannot be chosen by the prover, 
but must be given to the system (together with the considered geometric 
configuration and conclusion) . The proof must be given using the appropriate 
axiomatization. Indeed, a correct proof may be useless, if it uses definitions, 
properties or constructions that are not yet known by the student. 

More generally, to be convincing, a proof must use theorems, theories, . . . , 
known by the reader or explainable from the reader knowledge. 

The axiomatization in |B| consists in a set of function-free Horn clauses 
(possibly containing existential quantifiers). Our system can handle non- 
Horn clauses which could allow to encode case reasoning into the sys- 
tem (using for example formulas of the form parallel{{A, B),{C, D)) V 
nonparallel{{A,B),{C,D))). Moreover existential quantifiers are replaced 
by function symbols (using skolemization) . 

Very likely this flexibility makes the system usable for domains other than 
geometry with very few modifications (only some particular theories from the 
considered domain could have to be included into the unification algorithm) . 

— Treatment of equality. Geometry Expert does not use the equality predi- 
cate. Equality intervenes as associated to particular predicates (e.g. eqangles) 
and the equal objects are described in extension in the database. In contrast, 
HOARD atinf incorporates an equality axiomatization in an adequate form, 
adapted to the inference process, in particular for handling some geometric 
theories such as parallelism (see Section 2.4). Specific inference rules are nor- 
mally used for reasoning with equality: paramodulation (which corresponds 
to instantiation and replacement of equals by equals) and E-unification (i.e. 
unification modulo a set of equations). 

Dealing explicitely with equality is necessary in teaching the notion of proof. 
It must be handled similarly as human beings do it. Encoding it at the object 
level into geometric axiomatizations would not be really helpful. 

Efficient treatment of the equality predicate in Automated Deduction is well 
known as a difficult problem. Our new calculus (E-hyper-resolution) pro- 
vides useful features for handling it in the context of the considered class of 
problems: E-hyper-resolution prevents the generation of non-ground clauses 
and strongly reduces the search space. 
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— Handling theories. As HOARDatinf, Geometry Expert uses special mech- 
anisms to handle basic geometric properties (commutativity, collinearity, 
transitivity, . . . ) . 

The main idea in jS| is to reflect these properties in the structured database. 
We put them in the inference rule (more precisely in the uniflcation algo- 
rithm). 

Geometry Expert also uses a special mechanism for generating new auxil- 
iary points during proof search. Explicit generation of auxiliary points is 
avoided into our system, since this process is encoded into the considered 
axiomatization, using axioms with constructors, for example: 

(AB) II {CD)\/ inter section{{AB), {CD)) G (AB) 

Using this axiom, the point corresponding to the intersection of (AB) and 
(CD) is added into the set of clauses as soon as (AB) and (CD) are proven 
to be non-parallel. This point is denoted by inter section((AB), (CD)). 

The inference rules in |2| does not use function symbols, hence the use of 
existential quantifiers and explicit skolemization is needed. 

It is worth noticing that the heuristic used by Geometry Expert to guide the 
application of the generation of new auxiliary point rule can be simulated by 
the strategy described in Appendix B, which consists in first saturating the 
set of clauses without considering non-nested clauses, and then adding the 
set of non-nested clauses into the set of saturated clauses. Since non-nested 
clauses (in our terminology) corresponds exactly to existential axioms in 
0, this strategy is essentially equivalent to the one used by the Geometry 
Expert for generating auxiliary points. 

However, since we want to preserve refutational completeness, we do not — 
in contrast to 0 — impose further restrictions on the use of auxiliary points. 
Ghou, Gao and Zhang assume that all the auxiliary points are constructed 
using only non-auxiliary points. In our terminology, this could correspond to 
limiting the depth of terms to 2 (i.e. all terms occurring in the clause set must 
be constructed from constant symbols) . Such additional restrictions could of 
course be easily added into our system (at the cost of loosing refutational 
completeness). 

— Strategy. Whereas Geometry Expert uses a depth-first strategy, the system 
HOARD atinp uses a breath-first one for non-nested clauses. This strategy is 
necessary, since non nested clauses may lead to non-termination. On the 
other hand. Geometry Expert does not use a complete search strategy: it 
uses a depth-first strategy, together with some restrictions on the use of the 
non-nested geometric rules (i.e. the rules used to add new auxiliary points) in 
order to insure termination of the process (i.e. to limit the number of points 
that can be considered) . Since all the other rules correspond to nested clauses 
(since they do not contain any function symbol) this condition is sufficient 
to ensure the termination of the saturation process in 0 (see Appendix B). 

— Computing geometric figures. As in 0, HOARDatinf allows to com- 
pute automatically geometric figures for a given geometric configuration. 
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However, the method in 0 is restricted to very particular geometric con- 
figuration (called linear by the authors) whereas our method can be used 
on any configuration, provided that the corresponding set of algebraic equa- 
tions is solvable. Even if it is not solvable, the generation of additional logical 
consequences of the clause set may help to find a solution. 

In particular, it is easy to see that geometric configurations that are linear 
(in the sense of jH]) corresponds to a set of algebric equations that is already 
in solved form. Therefore, the generation of the model is straightforward in 
this case (no use of any formal system such as Maple, and no generation of 
further properties from the geometric configuration is needed). 

— Using diagrams. In H, diagrams are used only to handle negative condi- 
tions (notice that this could result in logically incorrect “proofs”!). We have 
investigated in the present work the use of diagrams to guide the search for 
a proof, by suggesting proof plans. To the best of our knowledge, no other 
work proposes a systematic use of diagrams to get proof plans. 



6 Conclusion and Future Work 

The reported research has hopefully shown that a general approach to a qualita- 
tive improving of theorem proving is realistic. Obviously, to be usable in practice 
it must be combined with techniques specialized to a particular domain. 

In this work, computer assisted learning of the notion of proof through ge- 
ometry has been chosen and the corresponding specialized techniques have been 
introduced. 

It should be emphasized that our present goal and the one for the near future 
are not to compete with the excellent (algebraic-based) existing geometric the- 
orem provers (benefiting from significant programming efforts for many years) 
but to set theoretical bases in order to incorporate powerful features of human 
reasoning using general techniques. This goal seems to us necessary in some 
uses of automated theorem proving, particularly in assisted learning. The ex- 
periments with our prototype for assisted learning have confirmed this idea. A 
lot of work remains to be done. We mention the main directions of present and 
future research: 

Combining HOARD atinf with other existing systems. We shall 
deepen our comparison with Geometry Expert 0 and evaluate to which 
extent our approaches can be combined to obtain a more powerful system. 
We shall also study the possibility of incorporating some features of 
HOARD atinp to the well-known system Cinderella m- 
— Incorporating symbolic computation systems. Geometric problems of- 
ten require various form of symbolic computation (e.g. dealing with relation 
on lengths or angles) . These kinds of reasoning cannot be efficiently encoded 
in first-order logic. In the present work we have used MAPLE in an ad 
hoc manner. In the future the use of symbolic computation tools (MAPLE, 
MATHEMATIC A, . . . ) should be systematic. 
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— Experimenting with abduction. Abduction, i.e. identifying hypothesis 
allowing to prove a given formula (or to explain a given fact) is an impor- 
tant form of the human inference activity. Abduction is one more application 
of our general approach (see maul). Recently, in m an algebraic-based 
method has been proposed for discovering additional hypotheses in order 
to make true a given statement. We will compare possibilities of both ap- 
proaches. 

— Exploiting full capabilities of our approach to analogy. To handle 
analogy in this work only a very weak version of the algorithm implementing 
our approach 11111 , 1112 ! was necessary. Using the full capabilities of it will 
allow to investigate much deeper geometric properties (see Remark EJ- 

— Using diagrams in logical frameworks. One of the authors have worked 
on geometry modeling in Coq (j2SEZ|) and appreciated the utility of dia- 
grams and the use of analogy (for example in generating lemmata or in proof 
planning) when interactively proven in Coq. Combining ideas and techniques 
in this paper with logical frameworks is an important spin-off of our work, 
presently under investigation. 
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A An (Automatically Generated) Proof for Example 

Remark 15. The justifications provided by the system (e.g. “propriete 2, chap. 
8”, etc.) refer directly to the geometric theorems and definitions given in the 
considered textbook (see Section 

- e est le milieu de [ctk] (hypothese) 

- (ctk) et (ke) sent paralleles, car e est le milieu de [ak] (par definition) 

- f est le milieu de [bk] (hypothese) 

“ (bk) et (kf) sont paralleles, car f est le milieu de [bk] (par definition) 

- k est le milieu de [ab] (hypothese) 

- (ab) et (bk) sont paralleles, car k est le milieu de [ab] (par definition) 

- (ab) et (kf) sont paralleles car (bk) et (kf) sont paralleles, 
et (ab) et (bk) sont paralleles (propriete 1, rappels) 

- (ab) et (cLk) sont paralleles car: k est le milieu de [ab] (par definition) 

- (ab) et (ke) sont paralleles car (ak) et (ke) sont paralleles 

et (ab) et (ak) sont paralleles (propriete 1, rappels) 

- (ke) et (kf) sont paralleles car (ab) et (kf) sont paralleles 
et (ab) et (ke) sont paralleles (propriete 1, rappels) 

- (ke) et (ef) sont paralleles car (ke) et (kf) sont paralleles (par definition) 

- (cLk) et (ef) sont paralleles car (ak) et (ke) sont paralleles 

et (ke) et (ef) sont paralleles (propriete 1, rappels) 

- j est le milieu de [ac] (hypothese) 

- i est le milieu de [be] (hypothese) 

- (ab) et (ij) sont paralleles car j est le milieu de [ac] 

et abc est un triauigle et i est le milieu de [be] (propriete 1, chap. 8) 

- (cLk) et (ij) sont paralleles car: (ab) et (ak) sont paralleles 
et (ab) et (ij) sont paralleles (propriete 1, rappels) 

- (ij) et (ef) sont paralleles car: (ak) et (ef) sont paralleles 
et (ak) et (ij) sont paralleles (propriete 1, rappels) 
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- (ck) et (je) sont paralleles car e est le milieu de [ak] 

et ack est un triangle et j est le milieu de [ac] (propriete 1, chap. 8) 

- (ck) et (if) sont paralleles car f est le milieu de [bk] 

et bck est un triangle et i est le milieu de [be] (propriete 1, chap. 8) 

- (if) et (je) sont paralleles car: (ck) et (je) sont paralleles 
et (ck) et (if) sont paralleles (propriete 1, rappels) 

- fije est un parallelograinme car (ij) et (ef) sont paralleles 
et (if) et (je) sont paralleles (definition 14, rappels) 

- abc est isocele en c (hypothese) 

- (ck) est la medians issue de c du triangle cab car k est le milieu de [ab] 
(definition 10, rappels) 

- (ck) est la hauteur issue de c du triangle cab car abc est isocele en c 
et abc est un triauigle et (ck) est la medicine issue de c du tricingle cab 
(propriete 5, chap. 10) 

- (ab) et (ck) sont perpendiculaires car (ck) est la hauteur issue de c du 
tricingle cab (definition 10, rappels) 

- (ck) et (ij) sont perpendiculaires car (ab) et (ij) sont paralleles et 
(ab) et (ck) sont perpendiculaires (propriete 3, rappels) 

- (ij) et (if) sont perpendiculaires car (ck) et (if) sont paralleles 
et (ck) et (ij) sont perpendiculaires (propriete 3, rappels) 

- fije est un rectangle car: fije est un parallelograinme et (ij) et (if) sont 
perpendiculaires (propriete 30, rappels) 
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- h est le milieu de [ad] (hypothese) 

- g est le milieu de [cd] (hypothese) 

- (ac) et (gh) sont paralleles car h est le milieu de [ad] , acd est un tricingle 
et g est le milieu de [cd] (propriete 1, chap. 8) 

- f est le milieu de [be] (hypothese) 

- e est le milieu de [ab] (hypothese) 

- (ac) et (ef) sont paralleles car f est le milieu de [be] , abc est un tricingle 
et e est le milieu de [ab] (propriete 1, chap. 8) 

“ (gh) et (ef) sont paralleles car (ac) et (gh) sont paralleles 
et (ac) et (ef) sont paralleles (propriete 1, rappels) 

“ (bd) et (gf) sont paralleles car g est le milieu de [cd] , bed est un tricingle 
et f est le milieu de [be] (propriete 1, chap. 8) 

- (bd) et (he) sont paralleles car h est le milieu de [ad] , abd est un tricingle 
et e est le milieu de [ab] (propriete 1, chap. 8) 

- (gf) et (he) sont paralleles car (bd) et (gf) sont paralleles 
et (bd) et (he) sont paralleles (propriete 1, rappels) 

- fghe est un parallelograinme car (gh) et (ef) sont paralleles, ghef est un 
quadrilataire et (gf) et (he) sont paralleles (definition 14, rappels) 

- (ac) et (bd) sont perpendiculaires (hypothese) 

- (ac) et (he) sont perpendiculaires car (ac) et (bd) sont perpendiculaires 
et (bd) et (he) sont paralleles (propriete 3, rappels) 

- (he) et (ef) sont perpendiculaires car: (ac) et (ef) sont paralleles, (ac) 
et (he) sont perpendiculaires (propriete 3, rappels) 

- fghe est un rectangle car fghe est un parallelograinme et (he) et (ef) sont 
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perpendiculaires (propriete 30, rappels) 

- longueur(segment(a,c)) = longueur (segment (b,d)) (hypothese) 

- longueur(segment(g,h)) = longueur(segment(a,c))*0.5 car h est le milieu de 
[ad], acd est un triangle et g est le milieu de [cd] (propriete 1, chap. 8) 

“ longueur (segment (g,f)) = longueur (segment (b,d)) *0.5 car g est le milieu de 
[cd] , bed est un triangle et f est le milieu de [be] (propriete 1, chap. 8) 

- longueur(segment(g,h)) = longueur (segment (g,f)) car 
longueur (segment (a, c)) = longueur (segment (b,d) ) , 
longueur (segment (g ,h) ) = longueur (segment (a, c) ) *0 . 5 

et longueur(segment(g,f)) = longueur ( segment (b,d)) *0.5 (par definition) 

- longueur (segment (e,f)) = longueur(segment(a,c))*0.5 car f est le milieu de [be] 
abc est un tricuigle et e est le milieu de [ab] (propriete 1, chap. 8) 

- longueur(segment(g,f )) = longueur (segment (e,f)) car 
longueur (segment (a, c)) = longueur (segment (b,d) ) , 
longueur (segment (g , f ) ) = longueur (segment (b , d) ) *0 . 5 

et longueur(segment(e,f)) = longueur (segment (a, c) ) *0 . 5 (par definition) 

- longueur(segment(h,e)) = longueur (segment (b,d)) *0.5 car h est le milieu de [ad] 
abd est un tricuigle et e est le milieu de [ab] (propriete 1, chap. 8) 

- longueur(segment(g,f )) = longueur(segment(h,e)) car 
longueur (segment (g , f ) ) = longueur (segment (b , d) ) *0 . 5 

et longueur(segment(h,e)) = longueur ( segment (b,d)) *0.5 (par definition) 

- fghe est un losange car longueur(segment(g,h)) = longueur(segment(g,f)) , 
longueur(segment(g,f )) = longueur(segment(e,f)) , ghef est un quadrilataire 
et longueur(segment(g,f)) = longueur(segment (h,e) ) (definition 15, rappels) 

- fghe est un carre car fghe est un losange et fghe est un rectcuigle 
(propriete 32, rappels) 
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Abstract. We propose the basis of a higher-order logical framework to 
axiomatize and build proofs in Hilbert’s elementary geometry in which 
intuitionistic aspects are emphasized. More precisely, we use the Calcu- 
lus of inductive constructions and the system Coq to specify geometric 
concepts and to study and interactively handle proofs for the first two 
groups of Hilbert’s axiomatics. It is the first step to a formalization well 
adapted to the definition of primitive operations that are used in many 
different geometric algorithms. 



1 Introduction 

We study a formalization of geometry well adapted to computer-related aspects 
of geometry, including computational geometry, geometric modelling, geometric 
construction and robotics. In this paper, we propose the basis of a higher-order 
logical framework to axiomatize and build proofs in Hilbert’s elementary geom- 
etry in which intuitionistic aspects are emphasized. 

The interest of an axiomatization of geometry for computer science has been 
well demonstrated by several authors, for instance Toussaint m and Knuth 
la. in order to better understand geometric universes and well define primi- 
tive operations that are used in many different algorithms. Our research group 
has reached the same conclusion for topology-based geometric modelling 
Moreover, in spite of the impressive results of the mechanical geometry theorem 
proving based on polynomial algebra I23EI, the crucial issue of round-off errors 
often makes logical reasoning in pure geometry preferred to computing in ana- 
lytic geometry when possible. This is the case for knowledge-based systems that 
symbolically solve geometric constraints, for instance in CAD |0|. 

In computer science, the need to actually build and manipulate geometric 
objects favors an intuitionistic approach, proscribing the excluded middle axiom 
or equivalent ones HH. This point of view has been defended by Heyting H21 
and developped by von Plato in a new approach of affine concepts. 

But instead of following a not much frequented way to address geometry (see 
for instance for various propositions related with this topic), we opted to 
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attack a commonly accepted geometry, hence our choice of Hilbert’s axiomatics. 
Thus, following the progression of Hilbert’s Grundlagen der Geometric , we 
have tried to express the beginning of this axiomatics and the proofs of funda- 
mental theorems in the chosen logical framework. When it was not natural for us, 
we have adapted Hilbert’s formulation, especially by distinguishing intuitionistic 
and classical aspects. 

More precisely, our framework is the Calculus of inductive contructions (CIC) 
which is based on type theory and offers a high abstract generic level of speci- 
fication and proof. It is implemented in system Coq developped at the INRIA 
and already used by von Plato for geometry and our research group for discrete 
topology 

The paper is structured as follows. Section 2 shortly presents the calculus 
of inductive constructions and Hilbert’s geometry with its five axiom groups. 
Section 3 gives some basic formal definitions of geometric types and relations in 
Coq. Section 4 specifies the axioms of the first Hilbert’s group. Sections 5 and 6 
each present a theorem of the first group. Section 7 does the same for the axioms 
and two theorems of the second group. Then, Section 8 develops the proof of a 
larger theorem which requires additional data types, and Section 9 gives some 
conclusions. Rather than giving a complete view of the results, we will focus on 
the novelties that come up at each step. 

2 Basic Notions 

2.1 Calculus of Inductive Constructions 

Type theories are logical frameworks the objects of which are typed lambda- 
terms. They can be distinguished by the power of their typing system and their 
underlying logic (see for instance uni). Examples are Church’s higher order logic, 
Martin-L6f intuitionistic type theory and the Calculus of inductive constructions 
(CIC) 0. A typed lambda-term formalism is related with its logical semantics 
by the Curry-Howard isomorphism, which states that: 

— any proof p is represented by a lambda-term P 

— any proposition s is represented by a type S 

— pis a, proof of s iff P is of type S 

The typing system of CIC is based on Girard’s polymorphic lambda-calculus, 
a very powerful typing system that allows the representation of proofs as well 
as types by lambda-terms, the underlying logic being intuitionistic. Another im- 
portant feature of CIC is the possibility to inductively define types that are used 
to define usual logical constructions with connectors and quantifiers as well as 
user application-oriented data types and predicates. Fundamentally, the generic 
inference rules, or type-checking rules, of CIC allow introduction or elimination 
of the universal quantifier, of the lambda-abstraction or of inductive constructs. 
All usual quantifiers and connectors manipulation rules can be considered to be 
derived from them. 
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To develop our specifications and proofs we have worked with system Coq, 
a CIC-based conversational proof assistant. The user communicates with the 
system through a high-level language called Gallina that allows the definition 
of axioms, parameters, types, functions and predicates. Coq also supports in- 
teractive proof constructions in which reasoning steps are specified by the user 
through commands called tactics, which in fact implement the CIC inference 
rules. This point will be illustrated in the subsequent geometric proofs. 

2.2 Hilbert’s Axiomatics 

In the late XIXth century, geometry is still based on Euclid’s theory, which 
itself is based on intuition. The creation of non-euclidian geometries leads a few 
mathematicians to go back to the roots of geometry and find alternate or better 
ways to found it. In 1899, David Hilbert publishes Grundlagen der Geometrie 
C3 in which he proposes a new axiomatics for elementary geometry. Hilbert 
pays special attention to issues such as axiom classification, independence and 
minimality. He is also the first one to abstractly link arithmetics and geometry 
by creating a linear and area calculus system. Hilbert classifies his axioms in five 
groups: 

I. Incidence axioms 
II. Order axioms 
HI. Congruence axioms 
IV. Parallel axiom 
V. Continuity axioms 

We have only studied group I and a significant part of group II so far. Each 
group features a number of important theorems in addition to the axioms, the 
proof of many of which is not given. Hilbert’s style is informal, therefore many 
formalization problems arise. We will focus on the methodology we used to over- 
come these problems. Besides, Hilbert works in a classical logical framework, 
whereas we try to be constructive. Therefore, we are especially concerned about 
the use of excluded middle axiom or its equivalents. 

3 Definitions 

Group I of Hilbert’s axioms deals with incidence. They express relations between 
points, lines and planes, these are abstract beings at the very beginning. Thus, 
in Gallina, they may be seen as belonging to one of three types designated by 
parameters declared by: 

Parameters point, line, plaine : Set. 

In Gallina, t : T reads “t is of type T” , and Set, which has nothing to do with set 
theory, is used to refer to “the type of concrete types”, i.e. the type of the type 
of the objects that are actually built and studied. This command adds point. 
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line and plane as three new constants of type Set in the environment. It can 
be understood as the axiom “there exist three objects of type Set named point, 
line and plane”. Hilbert then considers two abstract binary incidence relations, 
one between points and lines, the other between points and planes. They are also 
declared through parameters: 

Parameter inc-pt-ln: point -> line -> Prop. 

Parameter inc-pt-pl: point -> plane -> Prop. 

In these functional notations, Prop is the built-in type of logical propositions. 
Group II of ax;ioms deals with a ternary betweenness relation between points. 
As before, it is declared through a parameter: 

Parameter between: point -> point -> point -> Prop. 

Thus, (between ABC) stands for “B is between A and H”. Finally, it is con- 
venient to define some other relations built from those parameters, using the 
Gallina definition mechanism. For instance, alignment of three points is defined 
by: 



Definition alignedS: point -> point -> point -> Prop:= 

[A,B,C: point] 

(EX 1: line 

I (inc-pt-ln A 1) /\ (inc-pt-ln B 1) 

/\ (inc-pt-ln Cl)). 

This definition abbreviates the term [A,B,C: point] (...) into the symbol 
alignedS of type point Prop. The square-bracketed expression 

[A,B,C: point] represents the lambda-abstraction of variables A,B and C, and 
the expression (EX 1: line | ...) represents the existential quantification over 
variable 1. Thus, the whole term intuititively means “given three points A,B et C, 
there exists a line 1 such that A,B et C are incident to 1”. Likewise, we represent 
incidence between a line and a plane with the following definition: 

Definition inc-ln-pl: line -> plane -> Prop:= 

[a: line ;a’: plauie] 

(A: point) 

(inc-pt-ln A a) -> (inc-pt-pl A a’). 

where the notation (A: point) (...) represents the universal quantification of 
variable A. The whole definition means “given a line a and a plane a’, for any 
point A, if A is incident to a, then it is incident to a ’ ” . Many other definitions are 
built in a similar way. Note that, adapting Hilbert’s notation, we name points, 
lines and planes with capital, minuscules and quoted minuscules, respectively. 
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4 Axioms of the First Group 

We can then use these variables and definitions to give a naive Gallina translation 
of the first group of Hilbert’s axioms. The first of them is written by Hilbert: 

Axiom “For any given two points A and B, there is a line I to 

which both A and B are incident.” 

According to this formulation, A and B may be identical. However, when he 
refers to several beings of the same kind, Hilbert always assumes that they are 
all distinct. This will prove important to use axiom (1,1) constructively. Thus, 
when translating these axioms into Gallina, we systematically mention that all 
variables of the same kind are distinct. For instance, a Gallina version of axiom 
(1,1) in which A and B are supposed to be distinct is the following: 

Axiom AI-1 : 

(A,B: point) 

~A=B -> (EX a: line I 

(inc-pt-ln A a) /\ (inc-pt-ln B a) ) . 

This term can be understood as: “for any two points A and B, if A and B are not 
equal, then there exists a line a, such that A is incident to a and B is incident to 
a” . In a similar way, Hilbert writes the second axiom: 

Axiom (1,2): “There is no more than one line to which two given points 
A and B are incident.” 

Note that A and B are here obviously meant to be distinct, as this state- 
ment would otherwise be false in “natural” geometry, which Hilbert’s axioms 
are supposed to formalize. This axiom is much harder to express formally. In- 
deed, Hilbert’s formulation actually means “the number of lines to which two 
given points A and B are incident is no more than one” . Should we try to trans- 
late this statement directly, we would have in some way to formally describe 
the counting of lines satisfying a proposition. This is rather cumbersome as it 
would probably require the introduction of concepts such as naturals, infinite 
sets and cardinals, especially if performed in a constructive framework. But the 
main problem is that we have no way to make sure that our method of counting 
lines will yield the same results as Hilbert’s, as he did not give any properties 
of this process. The best we can do is saying that line counting is a rather easy 
and clear concept, and that Hilbert probably meant the same thing as we do. 
However, we choose a formal presentation of this axiom which is syntactically 
far from a direct translation, but semantically acceptable: 

Axiom AI-2: 

(A,B: point) (a, b: line) 

~A=B -> (inc-pt-ln A a) -> (inc-pt-ln B a) 

-> (inc-pt-ln A b) -> (inc-pt-ln B b) 

-> a=b . 
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which can be read as: “For any two given points A and B and any two given 
lines a and b, if A and B are distinct, if A and B are incident to a, and if A and 
B are incident to b, then a and b are identical”. Although other formulations 
are equivalent in classical logic, ours is rather powerful in intuitionistic logic, 
mainly because it yields a positive result El, which had already been used by 
m- Similar considerations that we cannot develop any further can be done on 
the six remaining axioms of the first group. 



5 Theorems of the First Group 

This section discusses the proof of Theorem I, which is given by Hilbert in his 
work. For modularity reasons, we have split it into four parts, numbered la to 
Id. Thus, Theorem la informally writes: 

Theorem la: “Two lines have at most one common point.” 

which can be translated in Gallina into: 

Theorem la: 

(a,b: line) (A, B: point) 

~a=b -> (inc-pt-ln A a) ->(inc-pt-ln B a) 

-> (inc-pt-ln A b) -> (inc-pt-ln B b) 

-> A=B. 

where counting is handled in a way similar to axiom (1,2). Thus Theorem la 
can read: “all points common to two distinct lines are equal” . To illustrate the 
basic mechanisms of the Coq system, we detail the proof of this theorem. In 
fact, the proof requires the addition of an axiom of excluded middle, or at least 
of decidability of point equality. But we can start by constructively proving a 
weaker version of this theorem: 

Theorem la-weai: 

(a,b: line) (A, B: point) 

~a=b -> (inc-pt-ln A a) -> (inc-pt-ln B a) 

-> (inc-pt-ln A b) -> (inc-pt-ln B b) 

~"A=B. 

which can read: “any points common to two distinct lines cannot be proved 
different” . 

Proof. — Tactic Unfold 1 not unfolds the first occurence of symbol ~, thus 
expanding the subterm ~~A=B into ~A=B False. 

— Tactic Intros introduces (a,b: line), (A,B: point), ~a=b, (inc-pt- 
ln A a) , (inc-pt-ln B a) , (inc-pt-ln A b) , (inc-pt-ln B b) , A=B 
as hypotheses, the only conclusion being False. 

— Tactic Absurd a=b allows us to prove any goal, in particular False, provided 
we prove ~a=b and a=b. 
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— Tactic Assumption proves ~a=b which is among the hypotheses. 

— Tactic Apply AI_2 with A:=A B:=B applies axiom AI_2 with convenient ef- 
fective parameters, and leads to five subgoals, which are all among hypothe- 
ses. 

— Finally, tactic Assumption used five times removes all of them. 

□ 

Then, the complete proof of Theorem la is easily obtained from the Theorem 
la-weak and the excluded middle axiom: 

Axiom EM: (p: Prop) p \/ ~p. 

or, what is equivalent here, the axiom of decidability of point equality: 

Axiom DEC_EQ_PT (A,B: point) A=B \/ ~A=B. 

The use of these axioms is natural in classical logic, but is prohibited in intu- 
itionistic logic. In fact, the lack of ways to infer a point equality with our axioms 
suggests that Theorem la cannot be proved constructively. Thus if we assume 
DEC_EQ_PT, a proof of Theorem la is: 

Proof. — As above. Tactic Intros introduces all premises into the local con- 
text. 

— Tactic (Elim DEC_EQ_PT A B) generates two subgoals: 

• A=B— ^A=B: removed by Tactic Trivial. 

• ~A=B— ^A=B: Tactic Intro introduces ~A=B into the local context. 

• Tactic Absurd ~A=B replaces the current goal by two subgoals ~A=B and 
~~A=B. 

* Tactic Assumption removes the first subgoal. 

* Tactic Apply la-weak with a:=a b:=b applies Theorem la-weak, 
and generates one subgoal for each premise of la-weak. 

* Finally, Tactic Assumption used five times removes the remaining 
subgoals. 

□ 



The second part of Hilbert’s Theorem I expresses: 

Theorem Ib: “Two distinct planes either have no common point or 
have a common line.” 

Its naive Gallina translation is: 

Theorem Ib: 

(a’ ,b ’ ; plane) 

~a’=b’ 

-> ~(EX A: point I (inc-pt-pl A a’) 

/\ (inc-pt-pl A b’)) 

I (inc-ln-pl a a’) 

/\ (inc-ln-pl a b’)). 



\/ (EX a: line 
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As previously, we can prove this theorem using the excluded middle or an equiv- 
alent. Here again, we conjecture that it cannot be proved constructively. This 
time, the reason is that it is a classification theorem, and that nothing in our 
axioms allows reasoning by cases. This anomaly is fixed by adding the excluded 
middle to the axioms. As in Theorem la, we can constructively prove for this 
theorem a weaker version, with which the excluded middle directly yields the 
full theorem. The third part of Hilbert’s Theorem I writes: 

Theorem Ic: “Two distinct planes with a common line have no common 
point outside this line.” 

There are several ways to formalize this theorem, for instance: 

Theorem Ic-neg: 

(a’,b’: plane) (a: line) (A: point) 

~a’=b’ 

-> (inc-pt-pl A a’) -> (inc-pt-pl A b’) 

-> (inc-ln-pl a a’) -> (inc-ln-pl a b’) 

-> ~~(inc-pt-ln A a). 

which expresses the fact that the planes a ’ and b ’ in play have no common 
point outside the common line a, or: 

Theorem Ic-pos: 

(a’,b’: plane) (a: line) (A: point) 

~a’=b’ 

-> (inc-pt-pl A a’) -> (inc-pt-pl A b’) 

-> (inc-ln-pl a a’) -> (inc-ln-pl a b’) 

-> (inc-pt-ln A a) . 

which expresses the fact that all points common to planes a ’ and b ’ are incident 
to the common line a. 

We have proved Ic-neg constructively, and Ic-pos classically with the ex- 
cluded middle. Once again, we conjecture that Ic-pos cannot be proved con- 
structively. Finally, the fourth part of Theorem I writes: 

Theorem Id: “A plane and a line that is not incident to this plane have 
at most one common point.” 

No new kind of problem arises in the treatment of this theorem. 

6 Theorem II 

In Group I of Hilbert’s axiomatics. Theorem II deals with existence and unicity 
of the plane incident to a line and a point, or to two lines: 

Theorem II: “There is a single plane incident to two given distinct 
convergent lines, or to a given line and point apart from this line.” 
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Like Theorem I, Theorem II is split into four parts, or “subtheorems” . The last 
two parts, which address unicity of the considered planes, can be formalized 
easily and proved constructively. The first two parts, which address existence of 
the planes, are slightly harder to formalize, and their proofs use the decidability 
of point equality. 

Here, decidability occurs in a new way. Indeed, decidability was previously 
used at the very beginning of the proof to produce two cases, one being proved 
constructively, and the other being proved by refutation. Thus Theorems la to 
Id can be seen as direct classical corollaries of constructive lemmas. In the first 
two parts of Theorem II, decidability applies to points built in the middle of the 
proofs, and is used to decide which of two constructions must be chosen. Here 
constructive parts and classical decision parts are interwoven in a more complex 
way than in Theorem I, thus non-constructivity must be handled in a different 
manner. A method would be to prove a lemma for each constructive part of 
the complete classical proof. The problem is that those lemmas are not very 
meaningful by themselves. Our method is to weaken the two existence-related 
subtheorems by adding an hypothesis, which will be used during the proof to 
make the decision that was previously made by the use of decidability in the 
classical proof. This extra hypothesis, called can-build-other-point-on-line 
is a much weaker and more specialized version of point equality decidability: 

Def inition Ccui-build-other-point-on-line : = 

(A: point) (a: line) 

(inc-pt-ln A a) 

-> (EX B: point I ~A=B /\ (inc-pt-ln B a) ) . 

which states that, given a line and a point on this line, one can build another 
point on this line. Using this definition we can express and constructively prove 
the following theorem: 

Theorem Ilexll-weak: 

CcUi-build-other-point-on-line -> 

(a,b: line) (A: point) 

~a=b 

-> (inc-pt-ln A a) -> (inc-pt-ln A b) 

-> (EX a’ : plane I 

(inc-ln-pl a a’) /\ (inc-ln-pl b a’)). 

which states that, assuming can-build-other-point-on-line, one can build 
a plane incident to two distinct given lines that have a common point. Once 
again, we have managed to express the full theorem as a classical corollary of this 
constructive theorem, as can-build-other-point-on-line is a consequence of 
point equality decidability. However, this weak constructive version is actually 
much weaker, as can-build-other-point-on-line must be assumed whenever 
we want to use it. This extra hypothesis can also be understood as a property 
we wish we had in our axiomatics, and could be used as a hint as to how to 
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modify Hilbert’s classical axiomatic in order to build a constructive one. The 
other existence subtheorem is handled in the same way. 

All theorems of the first group have been formally proved in classical logic, 
and almost none were proved in constructive logic. This strongly suggests that 
Hilbert’s axioms are not naturally well adapted for constructive reasoning, and 
need to be tweaked in order to fill our needs. 

7 Axioms and Theorems of the Second Gronp 

7.1 Axioms of the Second Group 

The four axioms of Group II of Hilbert’s geometry describe relation between 
(see Sect. 3). They are straightforwardly translated to Gallina, the first one 
being split into two parts numbered a and b: 

Axiom (11,1) part a: “If B is between A and C, then A, B, C belong 
to a line.” 

translated in Gallina into: 

Axiom All-la: 

(A,B,C: point) 

~A=B -> ~A=C -> ~B=C -> (between ABC) 

-> (alignedS ABC). 

Note that as usual we only consider distinct points. 

Axiom (11,1) part b: “If B is between A and G, then B is also between 
C and A.” 

translated into: 

Axiom All-lb: 

(A,B,C; point) 

~A=B -> ~A=C -> ~B=C -> (between ABC) 

-> (between C B A) . 

Axiom (11,2): “Given two points A and G, there exists at least one 
point B belonging to line AC such that G is between A and H.” 

Thanks to Axiom All-la, explicitely stating that A, B and G are aligned is 
redundant: 

Axiom AII-2: 

(A,B: point) 

~A=B -> (EX C: point I 

~A=C /\ ~B=C /\ (between A B O). 
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Another option was to give the name “line AF” to the line built between X and 
Y using axiom AI-1. 

Axiom (11,3): “Among three points on a line, there is no more than 
one that is between the other two.” 

translated into: 

Axiom AII-3: 

(A,B,C: point) 

~A=B -> ~A=C -> ~B=C -> (between ABC) 

-> "(between BAG). 

Propositions "(between CAB), "(between A C B) and "(between B C A) 
can then be deduced using AI-1 and All-lb. 

Axiom (11,4) (or axiom of Pasch): “Let A, B and C be three un- 
aligned points and a a line of plane ABC that meets none oi A, B and 
C\ if line a meets any point of segment AB, then it meets either a point 
of segment BC or a point of segment AC.” 

To make things shorter, we first introduce a predicate which states that a given 
line meets a point of a given segment represented by its ends: 

Definition line-meets-seg:= 

[a: line] [A, B: point] 

(EX C; point I "A=C /\ "B=C 

/\ (inc-pt-ln C a) /\ (between A C B)). 

Now we can write Axiom (11,4): 

Axiom AII-4: 

(A,B,C; point; a: line) 

"(alignedS ABC) 

-> "(inc-pt-ln A a) 

-> "(inc-pt-ln B a) -> "(inc-pt-ln C a) 

-> (line-meets-seg a A B) 

-> (line-meets-seg a A C) \/ (line-meets-seg a B C) . 

This axiom is central in Hilbert’s axiomatic, as it spawns the most interesting 
and meaningful consequences. In our axiomatic, it is also the first axiom that 
allows to infer a disjunction. 

7.2 Theorems of the Second Group 

Hilbert proposes a number of theorems and gives nonconstructive proofs for 
them. As in Theorem Ilexll-weak, we obtain constructive analog theorems by 
adding weaker versions of excluded middle to the premises of these theorems. 
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Theorem III: “Given two points A and C, there is at least one point 
D on the line AC between A and C.” 

In Gallina, this classical theorem writes: 

Theorem Till: 

(A,C: point) 

~A=C 

-> (EX D: point I ~A=D /\ ~C=D 

/\ (between A D C)). 

The only purely classical step in Hilbert’s proof is the construction of a point E 
not aligned with A and C, which cannot be done constructively without an extra 
assumption. Such an assumption is that, given two distinct points, one is able 
to build a third point such that the three points are not aligned: 

Definition can-build-unaligned-point : Prop:= 

(A,B: point) ~A=B -> (EX C: point I ~(aligned3 A B C)). 

Thus, adding this definition to the premises of Theorem III makes it construc- 
tively provable. This hypothesis is natural enough to consider that we do not go 
too far away from the spirit of Hilbert’s axiomatics. Theorem IV is handled in 
a similar way: 

Theorem IV: “Of three aligned points A, B and C, there is one that 
is between the two others.” 

This classical theorem writes: 

Theorem TIV : 

(A,B,C: point) 

~A=B -> ~A=C -> ~B=C -> (alignedS ABC) 

-> (between BAG) \/ (between A C B) \/ (between ABC). 

It is easily proved in classical logic, and requires the addition of two hypotheses 
to be proved in constructive logic. The first one is that all intersection points 
of distinct lines are equal, the second one is that the relation between is decid- 
able. They are respectively named eq-line-inter and dec-between. Finally, 
Theorem V writes: 

Theorem V : “Given four points of a line, they can be designated by A, 

B, C and D so that B is between A and C and between A and D, and 
that C is between A and D and between B and D.” 



This theorem has not been formally proved as it is a special case of Theorem VI, 
which is proved further. However, Hilbert’s proof uses two interesting lemmas 
which can be seen as transitivity properties of between: 
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Lemma V-1: “If B belongs to segment AC and C belongs to segment 
BD, then B and C belong to segment All.” 

Lemma V-2: “If B belongs to segment AC and C belongs to segment 
AD, then C belongs to segment BD and B belongs to segment AH.” 

Both of the lemmas can be proved classically, but also constructively when 
adding the previously defined hypotheses can-build-not-aligned-point and 
eq-line-inter. They will be crucial in the proof of Theorem VI, as they pro- 
vide a rather powerful manipulation tool for relation between that is much more 
intuitive than Pasch’s axiom. 

8 Theorem VI 

Theorem VI is an example of complex result that requires the introduction of 
new proof techniques. It writes: 

Theorem VI: “Given a finite number of points of a line; it is possible 
to designate them by A, B, C, D, E, K so that B is between A 
and every other point, that C is between A, B on the one hand and D, 

E, . . . , K on the other hand, that D is between A, B, C on the one 
hand and D, E, K on the other hand, and so on. Only the opposite 
denomination K, . . . , E, D, C, B, A shares the same properties.” 

In his book Hilbert does not give any proof for this theorem. Except for 
the unicity part, we have proved it semi-constructively with the help of Coq in 
the style of Theorems III to V. The formulation of this theorem is much trickier 
than in the previous theorems for two reasons. The first reason is that Theorem 
VI applies to a variable amount of points, which requires quantification on point 
sets rather than on individual points. The second reason is that we need to 
formalize the notion of point designation. We choose to represent a point set by 
a linear list of points, the designation of a point in this set being assimilated to 
its rank in the corresponding list. The theorem then translates into: 

Theorem VI (reformulated): “Any list I of distinct points of a line 
can be sorted into a list I' , such that each point in I' is between all of its 
predecessors in I' and all of its successors in 

In order to formally prove this theorem, we must axiomatize the notion of list 
and perform an induction on the list structure. 

8.1 Axiomatization of Lists 

We can immediately use Coq’s built-in lists, the inductive polymorphic type of 
which is defined by: 

Inductive list [A: Set]: Set:= 
nil: (list A) 

I cons: A -> (list A) -> (list A). 
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Thus list is an inductive type parametered by a set A and generated by the two 
constructors nil and cons. The standard Coq library provides some functions to 
manipulate or observe lists, such as In which tests whether an element belongs 
to a list, length that computes the length of a list, and rev that reverses a 
list. With these we define several auxiliary predicates: (external-point A 1) 
which states that, on a line, a given point A is not between any two elements of 
the point list 1 (note that A may belong to l); (distinct 1) which states that 
the points of a list 1 are all distinct; (permutation 1 1’) which states that 1 
is a permutation of 1 ’ ; (sorted 1) which states that 1 is sorted with respect to 
between as described above; (aligned 1) which states that all points of 1 are 
aligned. Using all these notions Theorem VI formally writes: 

Theorem TVl : 

dec-between -> 

can-build-unaligned-point -> eq-line-inter 
-> (1: (list point)) 

(aligned 1) -> (distinct 1) 

-> (EX 1’: (list point) I 

(permutation 1 1’) 

/\ (sorted 1’) /\ (distinct 1’)). 

The proof of this theorem uses Lemma EXISTS-EXTERNAL-PDINT, which finds in 
a point list a point that is external to this list. It is presented in the following 
section. 



8.2 Proof of EXISTS-EXTERNAL-POINT 

This lemma formally writes: 

Lemma EXISTS-EXTERNAL-POINT : 
dec-between 

-> can-build-unaligned-point -> eq-line-inter -> 

(1: (list point)) 

(aligned 1) -> ~l=(nil point) 

-> (distinct 1) 

-> (EX A: point I (In A 1) 

/\ (external-point A 1)). 

Proof. The Coq system is asked to perform induction on the structure of 1 
through Tactic Induction. There are two cases: 

— l=(nil point) : a contradiction is inferred with hypothesis 1= (nil point). 

— l=(cons N 1 ’): a second induction is performed on 1 ’, with anew two cases: 

• l’ = (nil point): N is easily proved to be a suitable external point to 1. 

• l’ = (cons HI 1’ ’): a third induction is performed on 1’ ’, with anew 
two cases: 
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* 1’ ’ = (nil point): N is again easily proved to be a suitable external 
point to 1. 

* 1’ ’ = (cons H2 1’ ’ we are in the general case, which is handled 
as follows. 

The general case is solved using as induction hypothesis the fact that EXISTS- 
EXTERNAL-POINT is satisfied for the list l’ = (cons HI (cons H2 1’’’)). This 
hypothesis is used to build a suitable point X external to 1 ’ that also belongs to 
lb 

We then show that either HI or H2 is distinct from X as all points in 
1 are distinct; that point is named H. We then use the constructive version 
of Theorem IV (Sect. 7.2) to infer (between N X H) V (between X H N) V 
(between X N H) . We have to study the three cases: 

~ (between N X H) : we want to prove that N is a point external to 1 that 
belongs to 1. 

~ (between X H N): we want to prove that X is a point external to 1 that 
belongs to 1. 

— (between X N H) : we want to prove that X is a point external to 1 that 
belongs to 1. 

In order to take care of the first case, we must show that (between N B C) V 
(between N C B) for any two points B and C in 1. To do so, we compare each of 
B and C to each of N, X and H, and thus create 2® subgoals, each corresponding 
to a combination of comparison results. The comparison is possible without 
assuming point equality decidability thanks to the assumption that all points in 
1 are distinct. Among these subgoals, 57 lead to a contradiction and are solved 
automatically using tactic Auto, and the remaining 7 show strong similarities. 
Thus they are all solved with two local lemmas that are proved using Lemmas 
V-1, V-2 and Axiom All-lb, and that take advantage of those similarities. We 
then have to prove that N belongs to 1, which we do by using the definition of 
1. The proofs for the second and third cases have the same structure as the one 
for the first case, except that N is replaced by X and that there is only one local 
lemma. 

This ends the proof by induction on the list structure that EXISTS-EXTERNAL- 
POINT is satisfied for any list. □ 

Now we can prove Theorem VI itself. 

8.3 Proof of Theorem VI 

The proof is led by induction on the length of the entry point list 1. 

Proof. System Coq is asked to perform Tactic Induction on (length I), i.e. 
on the structure of the naturals. There are two cases: 

— (length I)=D: obviously (nil point) is a sorted permutation of 1. 
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— (length 1) = (S n) : a second induction is performed on n, with anew two 
cases: 

• n=0: list 1 has a single element, and we immediately prove that 1 is a 
sorted permutation of itself. 

• n=(S n’): a third induction is performed on n’, with anew two cases: 

* n’=0: list 1 has two elements, and we easily prove that 1 is a sorted 
permutation of itself. 

* n’ = (S n’ ’ ): list 1 has three or more elements. We are in the general 
case, which is handled as follows. 

In the general case, we apply EXISTS-EXTERNAL-POINT to extract from 1 a point 
H that is external to 1. We then apply the induction hypothesis to the remaining 
list 1 ’ to produce 1 ’ ’ , a sorted permutation of 1 ’ . The first and last element of 
1 ’ ’ are respectively denoted by F and L. As H is a point external to 1 ’ (and thus 
to 1’’), we have by definition (between H F L) V (between H L F). Hence 
two cases are to be studied: 

— (between H F L): we prove that (cons H 1’ ’) is a sorted permutation of 
1 . 

— (between H L F): we prove that (cons H (rev 1’ ’)) is a sorted permu- 
tation of 1. 

In both cases, most of the proof is performed using a common lemma that in 
turn applies Lemma V-2. □ 

In fact, the complete proof requires lots of tedious auxiliary operations on lists 
and proofs of their properties. 

9 Conclusion 

In this work we have studied the first two groups of Hilbert’s axiomatics from a 
constructive point of view. In order to build a constructive specification, several 
variants for the same axioms and theorems have been proposed and compared. 
However, obtaining all of Hilbert results absolutely requires some purely classical 
features, such as decidability of some properties. 

The specification itself was developed in the Gallina language of powerful 
system Coq. Generic definitions have been written, and numerous lemmas and 
theorems have been interactively formally proved using techniques such as in- 
duction. The proof scripts amounted to roughly 120 pages, about half of which 
for Theorem VI and its auxiliary types and lemmas. 

This work only dealt with a very small part of Hilbert’s book |E|. Even the 
study of the second group is far from being complete: polygonal lines, closed 
curves, notions of inside and outside still have to be introduced and treated. In 
the following groups, Hilbert introduces other fundamental notions such as angle 
and figure congruence. To our knowledge, these concepts have never been ad- 
dressed from an intuitionistic point of view, and that would be a good challenge 
for the future. 
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As to the methodological aspects, the techniques we used to adjust classical 
specifications to the intuitionistic framework could probably be generalized and 
applied in fields other than geometry. 

One may bring into question the advantages of axiomatic methods, like ours, 
with respect to polynomial algebraic methods ESEI- When they are formalized 
and supported by proof assistants such as Coq, the former are as rigorous as 
the latter, as they do not let any exception case of the theorems or lemmas be 
eluded. Moreover, the former allow much easier understanding of the geometri- 
cal reasoning than the latter, which almost completely hide it. Also, axiomatic 
methods are not plagued by round-off issues. These are all good reasons to use 
symbolic approaches in CAD systems |B|. 

Conversely, general interactive proof assistants like Coq provide little help for 
proof discovery, whereas some geometry-dedicated algebraic systems are based 
on efficient mechanization processes 1231 . In order to be competitive, automatic 
geometric provers based on axiomatic methods must probably be guided by 
semantic verification by comparison with a sketch j2|. This way is followed in 
some CAD systems jSj- Unfortunately, it is difficult to imagine systematic ways 
to reveal degenerate cases with these methods. 
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